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. . . Abstract 

We study the classical rumor spreading problem, which is used to spread information in 
^^ , an unknown network with n nodes. We present the first protocol for any expander graph 

tN ' G with n nodes and minimum degree Q{n) such that, the protocol informs every node 

5-H , in O(logn) rounds with high probability, and uses O (log n log log n) random bits in total. 

The runtime of our protocol is tight, and the randomness requirement of (9(lognloglogn) 
random bits almost matches the lower bound of f2(logn) random bits. We further study 
rumor spreading protocols for more general graphs, and for several graph topologies our 
protocols are as fast as the classical protocol and use O(logn) random bits in total, in 
contrast to 0(71 log n) random bits used in the well-known rumor spreading push protocol. 
These results together give us almost full understanding of the randomness requirement for 
this basic epidemic process. 
^ I Our protocols rely on a novel reduction between rumor spreading processes and branching 

programs, and this reduction provides a general framework to derandomize these complex 
and distributed epidemic processes. Interestingly, one cannot simply apply PRGs for branch- 
ing programs as rumor spreading process is not characterized by small-space computation. 
Our protocols require the composition of several pseudorandom objects, e.g. pseudoran- 
i/-N ' dom generators, and pairwise independent generators. Besides designing rumor spreading 

CO i protocols, the techniques developed here may have applications in studying the randomness 

complexity of distributed algorithms. 

f^ , Keywords: distributed computing, rumor spreading, randomness complexity, branching 

CO ' programs, pseudorandomness 
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1 Introduction 



C^ , Rumor spreading is one of the most important communication primitives in large networks, and 

has been studied under different names such as gossip, information dissemination, or broadcast- 
ing. Efficient protocols for information spreading have wide applications in failure detection [35| , 



resource discovery [28[] , replicated database systems [ll|, ll8|] , and modeling the spread of com- 
puter viruses [J]. Besides computer science, the dynamics of such processes in social networks 
also constitutes a research topic in economics and sociology. 

The simplest and widely studied form of information spreading protocols is the so-called 
push model of rumor spreading. Initially, a message, called a rumor, is placed on an arbitrary 
node of an unknown network with n nodes. In subsequent synchronous rounds, every node 
that knows the rumor picks a neighbor uniformly at random and sends the rumor to the chosen 
neighbor. This process continues until every node gets the rumor. It was shown that this 
simple protocol is very efficient for several network topologies ISlJlQ. Il8l.l21|. In particular, its 
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runtime, the number of rounds required until every node gets the rumor with high probabihty, 
is logarithmic in the number of nodes in the graph. Graphs satisfying this property range from 
complete graphs, hypercubes, Erdos-Reyi random graphs, and "quasi-regular" expanders (i.e., 
expander graphs for which the ratio between the maximum and minimum degree is constant). 
Besides the efficiency, this protocol is local (i.e. no knowledge of global graph structure is 



needed), simple (e.g. nodes do not have initial IDs, in contrast to the protocol in J27l]), and 
can tolerate the failure of some links. More recently, several variations of information spreading 
protocols were proposed to allow information to spread efficiently on arbitrary networks [8|, or 
networks with weak expansion properties [7}]. 

Most of these algorithms are inherently randomized in both their design and analysis in that 
they crucially rely on the effect of choosing neighbors randomly for every node among different 
rounds. However, it is not clear if this randomization is essential for efficiently disseminating 
the information. Hence the randomness requirement, the number of random bits used in total in 
order to spread the rumor efficiently, becomes a key measurement for designing rumor spreading 
protocols. One of the most studied questions concerns the randomness requirement: how many 
random bits are sufficient to efficiently spread a rumor to all nodes in a graph? While for any 
graph with n nodes, the above-mentioned fully-random push protocol that finishes in T rounds 
needs 0{T ■ nlogn) random bits, it is not difficult to show that for any graph G of n nodes, 
there is a protocol which uses 3 log n random bits in total, and whose runtime is as fast as the 
standard fully-random protocol (See Corollarv IB. 21 for the formal statement). However, the 
explicit construction of such protocols is much more difficult, and a long line of research has 
been devoted to finding randomness-efficient protocols, see 12, ll2Ml4l. l23l . l24l . |27[ for instance. 



Prior to work, the best results need to use f^(log n) random bits, as the randomness used by 
different rounds have to be independent, and ri(logn) random bits are necessary per round. 

Our Results. The present paper constructs randomness-efficient rumor spreading protocols. 
Our main results are as follows: 

• Let G be an expander graph with n nodes and minimum degree 6 = Q{n). Then there is a 
protocol using O(lognloglogn) random bits in total, such that every node gets informed 
in O(logn) rounds with high probability (cf. Corollary [Sj 



• Let G be a general graph with n nodes, conductance (p and the ratio between maximum 
and minimum degree A/5 = 0(1). Then there is a protocol using 0((l/(/)) log n(log log n+ 
log A)) random bits in total, such that every node gets informed in 0{{l/(j))logn) rounds 
with high probability (cf. Theorem 14. ip . 

Note that any protocol needs at least il(logn) rounds to spread the rumor to all nodes, 
hence the runtime O(logn) rounds in our first result is tight. For the randomness requirement, 
our first protocol needs O(lognloglogn) random bits, which improves the previous best bound 
of 0(log n) random bits in total [24] . Since for any expander graph with n nodes and minimum 
degree 6 = 0(n), any protocol that finishes in O(logn) rounds with high probability needs at 
least ri(logn) random bits (cf. Theorem IC.2p . our bound is almost tight. 

The second result is for general graphs. The runtime here matches the upper bound known 
in the truly random protocol, and is tight, in the sense that there are graphs with diameter 
Q{{1 / (j)) log n) [9]. The randomness complexity of the second result improves the previous best 
one of 0{{l/(p) log n) random bits. See Table [1] for a comparison between our results and 
previous best ones. 

Techniques. Our main result is based on a generic reduction between the problem of designing 
rumor spreading protocols and the problem of constructing pseudorandom generators (PRGs) 
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Table 1: Comparison of tlie time needed to spread a rumor and the required number of random bits for various topologies. 
By A we denote the maximum degrees of G, and by </> the conductance of G. See Section 3 for the formal definition of 
conductance <h. 



for branching programs. This key insides is new in the area of studying epidemic algorithms. 
In retrospect, this connection between the two problems is natural because (1) random walks 
over branching programs resemble the rumor spreading process where nodes send messages to 
random neighbors, and (2) in a rumor spreading protocol, each node has access to only its 
own neighboring list, and is oblivious to the structure of the network. This is an analogue of 
oblivious derandomization achieved by PRGs. 

However, rumor spreading appears much more complicated than small-space computation 
due to the following facts: (1) In the rumor spreading process, rumors are "duplicated" every 
round, although every "existing" rumor viewed individually performs a random walk. Hence 
instead of considering every single random walk performed by any fixed rumor, we need to 
study the dynamics behind the whole rumor spreading process. (2) The state of the process 
at some time essentially depends on the past behavior of all nodes and is by no means com- 
putable in small space. Indeed, even knowing if a single node u gets the rumor at some round 
requires knowing the set of its neighbors having the rumor in the previous rounds, and may 
require deg(u) = Q(n) bits for dense graphs. For these reasons, this connection to small-space 
computation is delicate and not obvious. 

Surprisingly, we show that a reduction exists from the problem of designing randomness- 
efficient rumor spreading protocols to the problem of constructing PRGs for small-space compu- 
tation. Hence the question of designing randomness-efficient rumor spreading protocols is now 
exposed to the numerous techniques used in the study of constructing PRGs for small-space 
computation. In particular, the explicit constructions (resp. existence) of PRGs fooling certain 
branching programs imply the explicit constructions (resp. the existence) of rumor spreading 
protocols, and the explicit constructions of PRGs with optimal parameters imply randomness- 
optimal rumor spreading protocols for dense graphs. 

For general graphs, our present a protocol that uses independent seeds of length 0(log log n+ 
log A) in each round, where A is the maximum degree of the graph. The key idea is that the 
rumor spreading process enjoys nice locality when the maximal degree is small. Our protocol 
take advantage of this feature by using an class of objects called unbalanced expanders, which are 
then composed with pairwise independent generators. This technique yields much smaller seed 
length than using pairwise independent generators alone. This protocol has the advantage of 
being very simple. Furthermore, a simple variant of this protocol using PRGs for combinatorial 
rectangles achieves the tight runtime for strong expanders. 



Related Work. There is a large amount of literature devoted to various aspects of rumor 
spreading. The majority of research studies the rumor spreading time in terms of the graph 



properties, such as conductance 2l|], vertex expansion 2^], mixing time [5|, diameter 18|] and 
degree [18|. For instance, the first expUcit connection between randomized rumor spreading 
and graph expansion was estabhshed by Mosk-Aoyama and Shah [3l|, who proved that on any 
regular graph with conductance (/>, the protocol finishes in 0{{l/(p) ■ logn) rounds. 

The study of randomness complexity of rumor spreading protocols was started by Doerr 



et al. 13[], who proposed a quasi-random version of the above-mentioned push protocol. This 
quasi-random protocol is as follows: Every node u has a (cyclic) list of its neighbors. Once 
node u is informed, it starts at a random position of the list, but from then on node u informs 
its neighbors in the order of the list. In contrast to 0(n log n) random bits that used in 
the standard push model, they show that, by using O(nlogn) random bits, the quasi-random 
protocol is as fast as the truly random protocol for many graph topologies, e.g. complete 
graphs [2, UM, random graphs, hypercubes or expanders [ij]. It is known that for this quasi- 
random model one cannot further reduce this amount without a severe loss of efficiency [12]. 
Giakkoupis and Woelfel 23|] derived a protocol, which used 0(n log logn) random bits in total 



and finishes in O(logn) rounds on complete graphs of n nodes. Recently, Giakkoupis et al 



24l | presented two protocols. In contrast to previous work, the protocols in [2j| use only a 
poly-logarithmic number of random bits, and are as fast as the truly random protocol. Besides 
the rumor spreading problem, researchers also studied the question of designing randomness- 
efficient or deterministic protocols for similar problems. For instance, Haeupler |23] presented 
one deterministic gossip algorithm for the local broadcast problem. 

Throughout the paper we assume that nodes have no initial IDs, and we combine the pro- 
tocols with an ID distribution mechanism so that every node gets a unique ID once it gets the 
rumor. Moreover, we assume the standard adversary model, which was also used in 13l.l23l.l24|: 
In each round, every informed node u chooses an index j G {1, . . . , deg(tt)}, and sends a rumor 
to the jth node in its adjacency list. No edge connection information is available to u other 
than its adjacency list; and the order of u's neighbors in this list is determined by an oblivious 
adversary (before the algorithm is executed). 

Organization. The paper is structured as follows: We review some basic notations and tools 
that we will use in Section [2l Section [3] presents the protocols for dense expander graphs, and 
the reductions between the problem of rumor spreading and the problem of constructing PRGs 
for branching programs. In Section|3]we construct one protocol for general graphs. The protocol 
for strong expander graphs will be discussed in Section [5l 

2 Preliminaries 

Let G = (y, E) be a connected, and undirected graph with n nodes. The maximum, minimum, 
and average degree of G are represented by A, 5, and d. For any node u, the degree of u is 
represented by deg(u), and the set of neighbors of u is represented by N{u). Moreover, for any 
set S* C y, the neighboring set of S is defined by N{S) = UMe5-^(^)i ^^^ the volume of S is 
defined by vol(5) = X^ues deg(ii). For any set S,T C V, we define E{S,T) = {{u,v} : u £ 
S and v£T} and e{S,T)^\E{S,T)\. 

By log X we denote the binary logarithm of x. For any integer m, define [m] = {0, . . . ,m — l}. 
The product distribution of two distributions X and Y is denoted hj X xY. The disjoint union 
of a family of sets {Ai : i £ 1} indexed by I is denoted by |Jie/^* ~ Uje/{(^>^) • ^ ^ ^i}- 
With high probability stands for with probability 1 — o(l). 

We introduce the main tools that we use in our paper. 



2.1 Pseudorandom Generators 

One key tool used in our construction is pseudorandom generators. Informally, pseudoran- 
dom generators are deterministic algorithms / such that, for a given random string x chosen 
uniformly at random from set {0, 1}^, / produces a string f{x) € {0, l}'^^^\n{i) ^ £, which is 
indistinguishable from the uniform distribution over {0, 1}"(^' by a certain family C of functions. 

Pairwise Independent Generators. 

Definition 2.1 (Pairwise Independent Generator). We say Xq,. . . ,Xd-i with Xi distributed 
over [mi\ are e-pairwise independent if 

Pr [Xi = x] — :^ ^ e for all i G [d] and x € [rrii], and 

Pr [Xi = X A Xj = x'] — ^,^^. ^ e for all distinct i,j^ [d\ and all x G [rrij] , x' S [ruj] . 

We say they are pairwise independent if e = 0. We say Q : {0, 1}^ — t- [mg] x • • • x [nid^i] is an 
(e-)pairwise independent generator if its outputs are (e-)pairwise independent given a uniformly 
distributed seed. 

Lemma 2.2. Suppose Q = {Qq,. . . ^Qd-i) is a pairwise independent generator where Qi : 
{0,1}^ — )■ [m]. Define Q' = (^Q' • • ■ '^d-i) '^here Q'i{x) = Qi{x) vnodrui fori G [d]. Then 
Q' : {0, lY — ^ [?7io] X • • • X [rUfi-i] is an e-pairwise independent generator where e = 2/m. 

Proof. For distinct i,j G [d] and x G [mi], x' G [mj], let B (resp. B') be the preimages of 
X (resp. x') under the map s i— )• s mod rrii (resp. s i— )• s mod mj). Then ||i?| — m/mi[ ^ 1 
and ll-B'l — m/mj[ ^ 1. So Frs[Q'-{s) = x] = [B[/m which differs from 1/m-j by at most 1/m. 
Similarly Prs[Gi{s) = x A G'j{s) = x'] = [B[[B'[/m'^ which differs from l/{mimj) by at most 
2/m. ■ 

PRGs for Combinatorial Rectangles. Given d G N and finite set S = 5*0 x • • • x Sd~i, let 
Ms,d = {Aox...x Ad-i : ^0 ^ 5o, • • • , ^d-i ^ Sd-i} = H ^(^0, 

ie[d] 

where V{Si) is the power set of set Si. The members of Ms,d a-^^^ called (5, (i)-combinatorial 
rectangles. 

Definition 2.3 (PRGs for Combinatorial Rectangles). Given e > 0, d G N and finite set 
S = Sq X ■ ■ ■ X Sd-i, suppose a function f : {0, 1}^ — )• 5o x • • • x Sd-i satisfies the following 
property 



Pr^^eio,!}^ [fix) eA]- Pr^^Sox-xS.^i [x £ A' 



^ e 



for all A G A^5,d, then we say that f is a PRG that e-fools A4s,d with seed length i. 

Let S = [m] . By probabilistic methods there exists a PRG e-fooling Ais,d with seed length 
0(log m + log d + log(l/e)). The problem of explicitly constructing PRGs matching this bound 
is currently open. There is a long line of research devoting to this problem [j, llTI . l25l . |30|]. The 



current state of the art is given by [25[, which explicitly constructs PRGs e-fooling A4s,d with 
seed length 0(logm -|- logd) -|- 0(log(l/e)). 

Theorem 2.4 ([25]). Let S = [mY. There exists an explicit PRG Q that e-fools M.s,d with seed 
length 0(log m -|- log d -\- log(l/£) log log(l/e) log log log(l/e)) . [J 



^In [25|] the seed length is presented as 0((loglogm,)(logm + logd + log(l/e))) + 0(log(l/e)). But there are 
techniques of reducing m and d to m' = {1 / e)'~"'^\ d! = (l/e)"^^' using 0(logm + logd) randomness, cf. [a. l30| . 



Note that the seed length of the above PRG is 0(logr7T,) if d is polynomial in m and 1/e is 
slightly sub-polynomial in m. 

We then prove an analogue of Lemma 12.21 

Lemma 2.5. Suppose Q = {Go^ ■ ■ ■ ^Gd-i) is a PRG e-fooling A4s,d where S = [m]'^. Define 
Q' = {Q'q-, • • • , Q'd-i) where G'i{x) = Qi{x) mod rrij for i £ [d]. Then Q' is a PRG e' -fooling Ms',d, 
where S' = [mo] x • • • x [m^-i]; o,nd e' = e + (tuq + • • • + 7Ti^„i)/m. 

Proof. For AqX ■ ■ ■ x A^-i G Ms',d^ let Bi be the preimages of Ai under the map s i— )• s mod mi 
for all i. Then 5o x • • • x Bd-i G Ms4- Then Pr, [Aiefdl ^K*) ^ A^ = Pr, \^i^^d^ Gi{s) G B., 



which differs from rTjefdl (1-^*1/"^) ^^ ^^ most e since ^ is a pseudorandom generator e-fooling 
Ms,d- Note that \\Bi\ — \Ai\ ■ m/mi\ ^ \Ai\ ^ ttt-j for all i. A simple induction shows that 
Ilie[d]i\Bi\/'^) differs from Ui(^[d]i\M / m^) by at most 

y^ \\Bi\/m- \Ai\/mi\ ^ (mo H hmrf„i)/m. ■ 

ield\ 

PRGs for Branching Programs 

Definition 2.6 (Branching Programs). A branching program of length L, width W and degree 
D, or an (L, W, D)-branching program, is a directed (multi)-graph with node set [W] x{0, . . . ,L}. 
We say the nodes in [W] x{i} are on the ith layer for ^ i ^ L. Each node {u,i) except those on 
the last layer has D outgoing edges to nodes on the next layer, and these D edges are associated 
with D distinct labels from [D] . 

Let ;B be a branching program of length L, width W and degree D. For x = (xi, . . . , xl) G 
[D] and a node (s,0) on the first layer, define B{s,x) G [n] such that the random walk that 
starts from (s, 0) and takes the edge with label Xi at the ith step for 1 ^ i ^ L finally arrives 
at {B{s,x), L). 

Definition 2.7 (PRGs for branching programs). A function f : {0, 1} — t- [D] is a PRG that 
e-fools (L, W., D)-branching programs if for any branching program, B of length L, width W and 
degree D, any node (s, 0) on the first layer, and any {t, L) on the last layer, it holds that 



Pr^e{o,i}^ [ ^(^' fix))=t]- Pr^epji [ B{s, x) = t] 



^e- 



Many attempts were made to construct explicit PRGs for branching programs [29l. |32|. I3c 
due to its connection with the RL vs. L problem. The probabilistic method guarantees the 
existence of a (non-explicit) PRG that e-fools (L, VF, D)-branching programs with seed length 
0(log L + log W + log D + log(l/g)). The problem of explicitly constructing such PRGs remains 



a great open problem. See [29|, |33, l33| for known explicit constructions. We will use the 
constructions in |29, \32 



Theorem 2.8 (|29l. 1321]). There exists an explicit PRG f that e-fools {L,W, D) -branching pro- 
grams with seed length 0(log L(log W + log D + log L + log(l/e))) . 

Combining diflferent generators. Finally, we show the properties of the combination of 
different PRGs. 

Lemma 2.9. There exists an explicit function Q : {0, 1}^ — )• [m]'^ that is both a pairwise 
independent generator and a PRG e-fooling A^[m]d,d '"^^^^ ^^^d length 

0(log m + log d + log(l/e) log log(l/e) log log log(l/e)). 



Indeed, suppose Q = {Qq, . . . ,Q\_i) is a pairwise independent generator with seed length i\ = 
0(logm + logd), where Q\ : {0, l}^i -^ [m] for i e [d]. Suppose G^ = {G^ ..., gj_-^) is a PRG 
e-fooling M.\^T.d,d '^^^^ ^^^d length £2 = 0(logm + logd + log(l/e) loglog(l/e) logloglog(l/e)). 
Identify [m] with Z„ and define G = {Go, • • . ,Gd~i) ■ {0, if^ x {0, if^ -^ [mf by 

Giixi,X2) = Gt{xi) + G!{x2) 

for i £ [d], where the additions are performed over Z^- Then G is both a pairwise independent 
generator and a PRG e-fooling TWr^id ^. 

Proof. Fix xi G {0,1}^ and consider X2 uniformly distributed over {0,1}^. Let j4o x • • • x 
Ad-i S -^fml<*,d be a combinatorial rectangle. For i G [d], let A'^^ = {y — G^ixi) : y G Ai} be the 
set obtained by shifting Ai by (— ^^^(xi)) in [m] = Z^. Hence the probability that G{xi,X2) is 
in a combinatorial rectangle Aq x ■ ■ ■ x A^-i G -A4[m]'*,d is just the probability that GH^2) is in 
{A'qX---x A'^_^) E M[m]d,d- This probability differs from Uield]iK\/m) = Ui&[d]i\Ai\/m) by 
at most e since G'^ e-fools M-i^^jd^^. So G e- fools M[m]<i,d ^^ well. A similar argument shows that 
^ is a pairwise independent generator. ■ 

2.2 Unbalanced Expanders with Near-Optimal Expansion 

We consider the following kind of left-regular bipartite graphs. 

Definition 2.10. Let T : [N] x [D] — > UiefDlI^^j] ^^ '^ function where T{x,y) G [My] for any 
X G [N], y G [D]. Function F specifies a left-degree D bipartite graph with left vertex set [N] 
and right vertex set UiefDlI^*] ^'^ ^^^ following way: for x € [A^] and y € [D], the yth neighbor 
of X is given by V{x,y). 

We are interested in graphs F exhibiting excellent expansion properties. This leads to the 



notion of unbalanced expanders [26|, |34 1 



Definition 2.11 (Unbalanced expanders fi^, [11]). Let F : [A^] x [D] -^ Ui6[D][^^i] ^^ as in 
Definition \2.1(A We call F a [K, A) -expander if for any set S C [N] of size K, it holds that 
\N{S)\ ^ AK. We call F a {s^K, A)-expander if it is a (AT', A)-expander for all AT' ^ AT. 

In particular we are interested in (AT, A)-expanders, where the parameter A = {1 — e)D 
for small e, i.e. for any subset S of size K from the left set [A], there is almost no collision 
among the neighbors of nodes in S. Explicit constructions of such unbalanced expanders with 
near-optimal expansion are known. 

Theorem 2.12 (J26|). For any N G N, K ^ N, and e > 0, there is an explicit (A, (1 — 
e)D)-expander F : [A] x [D] -^ U^^^D]Wi\ ™^^ ^ = (^J and Mq = ■ ■ ■ = Md-i ^ 

max{A»,A:'^(i)}. 

Assume that F : [A] x [D] — ^ UiefDlt^*] i^ ^ (-^j (1 ~ e)A>)-expander. We consider the 
map F(-, [/) applied on any K elements of [A^] where U is uniformly distributed over [D]. The 
following lemma states that with high probability these A elements are mapped into UiefDlt^*] 
with almost no collision. 

Lemma 2.13. Let F : [A] x [D] -^ \_\i^[D][^i\ ^^ ^ i^^ (1 ~ e)D)- expander. Let S be a subset 
of [A] of size K. Then for at least (1 — ^/e)-fraction of y £ [D], it holds that \{T{x,y) : x € 
S}\ ^ (1 - ^)K. 



^The definition here is slightly different from [2y, [SJ] as we require r{x,y) £ [My]. This is analogous to the 
difference between standard and strong condensers. 



Proof. The size of N{S) = Uye[D]{^ix^ y) : x e S} is at least {l-e)DK as T is a {K, {l-e)D)- 
expander. So Ej^ [|{r(x,y) : x G S}\] ^ (1 — e)K with y uniformly distributed over [D]. Also 
note that \{T{x,y) : x G 5}| ^ |5| = K for any y G [D]. Applying Markov's inequality on 
K-\{T{x,y) : x G 5}|, we have Prj,[|{r(x,y) : x G S}| < (1 - ^)/i:] ^ V^. ■ 

3 Protocol for Expander Graphs 

Let G = (y, E) be an undirected and simple graph with V[G\ = [n]. We consider only T'-round 
protocols for G, in which nodes send rumors only for the first T' rounds, and assume that 
T' = (log?7,)'^(^' is an upper bound of the rumor spreading time. We combine the protocols 
with an ID distribution mechanism so that every node gets a unique ID once it gets the rumor. 
Specially, in round there is one arbitrary node having the rumor, and the ID of this node 
is set to be 0. We assume that node knows T', n, and the maximum degree A. Moveover, 
node chooses a binary string, called seed, uniformly at random, and the seed is appended 
to the rumor. In subsequent rounds, whenever one node with ID u sends the rumor to one 
of its neighbors in round t, it also sends a unique string consisting of the ID u, current round 
number i, as well as parameter T', n and A (Our protocols work fine as long as the initial node 
having the rumor knows some upper bound n' of n, where n' is a polynomial of n). A node is 
uninformed as long as it has not received a rumor. Once a node receives the first rumor from 
an informed node with ID u in round t, it becomes informed and gets a unique ID defined by 
gt{v) = 2*~^ + u. If one node becomes informed from multiple informed nodes, then this node 
chooses an arbitrary node with ID u that informs it and uses gt{u) as its ID. 



Claim 3.1 ([2j]). Through the protocol above, all informed nodes have different IDs. Moreover, 
if the protocol finishes in T rounds, then all the IDs are in [2-^] . 

Throughout the rest of the paper we assume that all the IDs are in a set [0, n^] for a constant 
c ^ 1. Denote by s the initial node having the rumor. All the omitted proofs of the lemmas 
can be found in the appendix. 

3.1 Protocol 

Let Ac be the adjacency matrix of G, and D the nxn diagonal matrix defined by Duu = deg(n) 
for u G ^[G]. Let Mg = D~ Ac be the transition matrix for the random walk over G, and 

Define the n real eigenvalues of Nc by 1 = Ai ^ . . . ^ A„ ^ — 1. 

Definition 3.2 (Spectral Expansion). Graph G has spectral expansion a if \ ^ 1 — a, where A 
is the second largest (absolute) eigenvalue o/Ng. 

In this subsection we study protocols for graphs G with A/5 = 0(1) and spectral expansion 
a. Let f3 = A/ 5 = 0(1). The main result of Section [3] is as follows. 

Theorem 3.3. Let G be a graph of n nodes with A/5 = 0(1) and spectral expansion a 
satisfying a ^ 8A^/5^'^. Then the following statements hold: (1) There is a protocol using 
0(logn(loglogn + log(l/a))) random bits such that with high probability, all but o{\)-fraction 
of the nodes in G get informed in 0(a^^ log n) rounds. (2) Moreover, if 5 = Q{n), then there 
is an explicit protocol using 0(logn(loglogn + log(l/a))) random bits such that with high prob- 
ability all nodes in G get informed in 0{a~'^\ogn) rounds. 

For the case that a is a positive constant, i.e. G is a expander graph, we may always assume 
that a ^ 8A^(5~^'^, since otherwise A < 64/3^/a^ = 0(1) and there is a simple deterministic 
protocol that finishes in O(logn) rounds. 



Corollary 3.4. Let G he an expander graph of n nodes where A/5 = 0{\), 5 = oj{\) and 
a = 0(1). Then the following statements hold: (1) There is a protocol using O(lognloglogn) 
random bits such that with high probability, all but o{l)-fraction of the nodes in G get informed 
in O(logn) rounds. (2) If G is a complete graph of n nodes, then there is an explicit protocol 
using O(lognloglogn) random bits such that with high probability all nodes in G get informed 
in O(logn) rounds. 

This result is based on a reduction between rumor spreading protocols and PRGs for branch- 
ing programs. We remark that the randomness complexity for Theorem 13. 31 and Corollary 13.41 is 
sub-optimal only because we do not yet know good enough explicit PRGs for branching programs 
and, as stated later, any construction of the PRGs with optimal parameters implies the explicit 
construction of randomness-optimal rumor spreading protocols for dense graphs of n nodes 
that use 0(logn) random bits in total. See Theorem IC.2I for the lower bound of randomness 
requirement for graph G with 5 = Q{n). 

Our protocol is based on pairwise independent generators, and PRGs for branching programs, 
and the formal description of the protocol is as follows: 

Protocol 1 (Protocol for Expander Graphs). Let m = n^^^> be a sufficiently large power of 2. 
Pick the following objects: 

• An explicit pairwise independent generator Q = [Qq, . . . ,Qn^i) : {0,1}^ — )• [m]", where 
£ = 0(logn). 

• An explicit PRG / = (/i, . . . , /t) : {0, 1}^' -^ ({0, 1}^)^ that e-fools (T, n^, 2^^) -branching 
programs with e = n~^ and seed length I' = O(logrelogT). 

These two objects Q and f can be uniquely constructed from n'^, and hence are known to every 
informed node. 

The initial node having the rumor chooses a random string x G {0, 1} . This random string 
is appended with the rumor and sent to other nodes. Once one node gets the rumor, it gets 
the ID u. In the ith round, 1 ^ i ^ T, node u sends the rumor to the neighbor with index 
Gu{fi{x)) mod deg(M) in its adjacency list. 

Proposition 3.5. Assume that Protocol [I\ finishes in T rounds. Then it uses O(lognlogr) 
random bits in total. 

Remark 3.6. One difference between Protocol[I\ and the protocols in [2J,J is that in Protocol[I\ 
different rounds use one common seed x, while in l24] the seeds used by different rounds are 
different and mutually independent, and hence the analysis is much easier. 

3.2 Analysis 

We introduce the notation used in the analysis. For m G N, vector u G M"* and real number 
p ^ 1, define the norm ||u||p = (X^^^i |uj|^) '^. In particular, ||u||oo = maxi<gj<gm |ui|. The 
inner product of two vectors u, v G M™ is (u,v) = Yl^i^i^i- ^^ write Im for the vector in 
R™ having ones in all entries, or simply 1 if the dimension is clear from the context. Similarly 
write Om or for the zero vector. Let e, be the vector that has an one in the ith entry and zero 
elsewhere. Write Im or I for the m x m identity matrix. For a matrix M G M™^™ ^ we use Mjj 
to denote the entry on M's ith row and jth column. For p G [1, oo) U {oo}, define 

||M||p = sup " I . 

It is easy to show that ||M||i equals the maximum of the ^i-norms of the rows of M. And 
1 1 M I loo equals the maximum of the £i -norms of the columns of M, or equivalently ||MT^||i. We 
say a square matrix M is stochastic if all of its entries are non-negative and all of its rows have 
£i-norm 1. Clearly if M is stochastic, then ||M||i = 1. 



Lemma 3.7. 



|M||i^ 



|M||i||M| 



Define linear operators 71,72 : IK" — > M" such that, for a vector u G M" indexed by 
{u,v) G y[G] X V[G], we have (Ti(u))„ = Y.veV[G] ^{u,v) and (7i(u))„ = Y.veV[G] ^{v,u)- When 
u represents a probabihty distribution over V[G] xV[G], the vectors 71 (u) and 72(u) represents 
its two marginal distributions. 

3.2.1 Proof Sketch 

Now we discuss the intuitions behind constructing the protocol and sketch our proof. A main 
technique that we use here is a generic reduction between rumor spreading processes and branch- 
ing programs. More specifically, we compare the process of rumor spreading with a random walk 
on a branching program. For random walks, a walk always stays at a single node throughout 
the process, although this node keeps changing. On the other hand, in the process of rumor 
spreading, each informed node u randomly sends the rumor to one of its neighbors v in each 
round, and then n, v are both informed subsequently. So we may think of rumor spreading as 
many random walks in parallel: When node u sends the rumor to v, one random walk moves 
from n to f whereas another one stays at u. In order to characterize this behavior precisely, we 
introduce the notion of colored random walks. 

Definition 3.8 (Colored Random Walk). Given a random rumor spreading process on a graph 
G with the initial node s, an associated colored random walk of length T is a sequence ofT+1 
nodes (po, . . . ,pt) of G together with a lazy/non-lazy coloring on the pairs {pi,pi^i) called 
colored edges for ^ i <T, such that po = s, and 

1. if {pi,pi+i) is lazy, then pi = Pi+i; 

■2- if (PiiPi+i) i^ non-lazy, then pi picks Pi+i in round i -\- 1 in the rumor spreading process. 

Node s itself is a colored random walk of length 0. 

We denote by Ct — {lazy, non-lazy}^ the set of possible color patterns for length-T colored 
paths. For S £ Ct, there exists a unique colored random walk of length T with color pattern 
S associated with a given instantiation of a rumor spreading process. Let X^ be the indicator 
random variable of the event that the colored random walk with color pattern S finally reaches 
node u. Note that u receives the rumor in T rounds if some colored random walk of length T 
arrives at u. Therefore 



Pr [ u receives the rumor in T rounds ] = Pr 



S(^Ct 



(3.1) 



We want to reduce the global event "^g^^ -^u > to local events X^ > 0. Note that the 
indicator of J2seC -^u > is the conjunction of those of X^ > 0. We use the Cauchy-Schwarz 
inequality to "linearize" the conjunction operation and then use the linearity of expectations: 



Pr 



S&Ct 



E 



^Esec-^ XS>0 



E 



> 



l2seCT ^« 



E 



zZseCr -^ii- 



I^s,5'eCT^[^« ]^ 



^u 



j:s,s'eCr^[XSXS"_ 



(3.2) 



Note that each E [ X^ ] only depends on a single random walk with color pattern S, and hence 
can be well approximated by the PRGs for branching programs. The same is true for each 
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E X^X^ if we use branching programs with set of states V[G] x V[G] to keep track of two 
colored random walks simultaneously. 

If colored random walks were pairwise independent, then E X^X^ = E [^^] E X^ 

for S ^ S' and the lower bound given by (|3.2p would be very close to 1. Unfortunately they 
are not pairwise independent: if two colored paths are at the same node in some round and 
their next steps are both non-lazy, they will move to the same random neighbor rather than 
two independent random neighbors. Consequently, for a typical pair of colored random walks 



{S,S') we only have E 



where c > 1 is some constant. Then 



x^xi'l ^c-b[x^]b[x^ 

the probability lower bound given by ()3.2p is no better than 1/c. To remedy this problem, we 
observe that (j3.2p still holds if we replace Ct by any subset C'j, '^ Ct, and (jS.ip can be rewritten 
as 



Pr [ u receives the rumor in T rounds ] ^ Pr 



E ^« > 



S^C'j, 



for any C^ C Ct. This leads to the following lemma. 

Lemma 3.9. In the random rumor spreading process, the probability that node u £ y[G] receives 
the rumor in T rounds is lower bounded by 

X., 



E 



S,S'&C' 



E[Xf]E 



Es,5'ec^E[X„^Xf] ' 

where C^ is an arbitrary subset of Ct- 

By a careful choice of C'j, along with a delicate analysis we can show that two typical colored 
random paths are "mildly" pairwise independent, yielding a 1 — o(l) lower bound in Lemma [3.91 
Hence the construction of PRGs for branching programs implies an explicit protocol for rumor 
spreading. 

Another important issue is that random walks on branching programs only take one step 
each time, whereas in the rumor spreading process multiple steps are took simultaneously, each 
requiring some randomness. We use a pairwise independent generator for the choices of these 
steps. This is perfectly compatible with the discussion above as we only use expectations of 
monomials X^ and X^X^ , neither involving more than two variables and hence pairwise inde- 
pendence suffices. Formally the following theorem shows a reduction from PRGs for branching 
programs to rumor spreading protocols. 

Theorem 3.10. Suppose f is an explicit PRG that e-fools {L,W,D) -branching programs with 



seed length £, where L = Q{a logn), D 



n 



6(1) 



sufficiently large, e 



n 



and W 



n 



Then there exists an explicit rumor spreading protocol using £ random bits such that, for any 
graph G with n nodes, IS./ 5 = 0(1) and spectral expansion a ^ 8A^5~^-^, all but o(l) fraction 
of the nodes in G get informed in 0{a^^ logn) rounds with high probability. 

By Theorem 13. lot an explicit PRG construction with good enough parameters, in particular 
with seed length £ = O(logn) and error e = n~^, yields explicit rumor spreading protocols using 
only O(logn) random bits. In addition, we usually only need to fool short branching programs 
instead of those in full generality (i.e. of length n®^^)) since the process of rumor spreading 
typically finishes in a small number of rounds. The PRG in [33| fools such branching programs 
using only O(logn) random bits, but the error obtained is too large for our application, being 
2" log n £qj, g^j^y c o u s t a ut u > instead of n~^^^. We will use the PRG constructions from 



or 



321 ] instead for our protocol. 

Theorem 3.11 ([29|, Is^]). There exists an explicit PRG f that e-fools {L,W,D)- branching 
programs with seed length O(lognlogL) for W = n^^\D = n^^', and 1/e = n^^' . 

The first statement of Theorem 13.31 follow from Theorem 13.101 and Theorem 13.111 
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3.2.2 Proof of Theorem [HHUl 

To prove Theorem 13.101 we first generalize Protocol [1] to a family of (not necessarily explicit) 
protocols. That is, instead of studying Protocol [1] where every informed node chooses neighbors 
according to the output of the PRGs, we consider a family of protocols, and each of them is 
specified by a joint distribution of random variables V = (Xi, . . . ,Xt)-, such that the random 
choices in round i are determined by the ith variable Xi with a pairwise independent generator. 

Protocol 2 (Protocol via distribution V). Let m = n^^' be a sufficiently large power of 2. 
Let Q = {Qq, . . . ,Qn-i) '■ {0,1}^ —7- [m]"' be an explicit pairwise independent generator, where 
£ = O(logn). A distribution V over ({0, 1} ) specifies the following protocol: 

The initial node having the rumor chooses a random string (xi,...,xt) G ({0,1}^) ac- 
cording to the distribution V . This random, string is appended with the rumor and sent to other 
nodes. Once one node gets the rumor, it gets the LD u. In the ith round, node u chooses the 
neighbor with index Gui-Ci) mod deg(u) in its adjacency list to send the rumor. 

Setting V as the uniform distribution, denoted by U, yields the hashing-based protocol in 



24l |. We recover Protocol [T] by letting V be the output of a PRG for branching programs. 

We wish to bound E [ X^ ] and E X^X^ and then apply Lemma 13.91 For the family 
of protocols above, these quantities can be easily characterized using branching programs. For 
color patterns S, S' £ Ct, define the branching program I3s,s' of length T, width n^ and degree 
2^ as follows: For {u,v) e V[G] x V[G], i G [T] and x E {0,'l}^ the edge of {{u,v),i) labeled x 
goes to {{w, y),i + 1) where w and y are picked in the following way: If the ith color of S (resp. 
S') is lazy, then let w = u (resp. y = v), otherwise let w (resp. y) be the zth neighbor of u 
(resp. v), where z = Guix) mod deg(u) (resp. Gv{x) mod deg(t;)). The following lemma follows 
immediately from this construction: 

Lemma 3.12. For Protocol that is specified by a distribution V over ({0, 1}^) and any 
u,v G V[G], S,S' G Ct, it holds that 



E 

and 



^0, ^0, 



Pr^^P [Bs,s'{{s,s),x) = {u,v) 



E[^f]= Yl Fr,x^v[l3s,S'{{s,s),x) = {u,v)]= Y, Pr^r^v [l3s',s{{s,s),x) = {v,u)] . 

vGV[G] v£V[G] 

Recall that Mg = D~ A^ is the transition matrix for the random walk over G. We define by 

2 2 

P G M" ^" is the transition matrix with its rows and columns indexed by the set V[G] x ^[G], 
which corresponds to two non-lazy steps in parallel: 






{Mg0Mg)(u,v){w,x) u^v, 

(Mg)(„,^) u = v,w = x, 

u = v,w ^ x. 



Remark 3.13. MatrixP agrees with Mc (g) Mfj except on the rows indexed by {u,u), u G ^[G]. 
This is a manifestation of the fact that the "non-lazy" steps from the same node made by two 
different colored random walks are not independent, i.e., every informed node can only send the 
rumor to one neighbor in each round. 

To study random walks associated with certain color patterns S = {Si, . . . , St) G Ct, S" = 
{S'l, . . . , S'rp) G Ct, we introduce the matrix Jgg, G M" ^^ defined by 

T 

A TT T» 



T* A TT -iW 



i=l 



12 



where each matrix J^ 5/ is defined by 



'S,S' 



I (g) I Si = lazy, S[ = lazy 

I (g) Mfj Si = lazy, 5^ = non-lazy 

Mg (8> I 54 = non-lazy, 5^' = lazy 

P Si = non-lazy, 5' = non-lazy. 



By definition, the stochastic matrix J^ g, corresponds to two colored random walks in parallel 
over the graph G, with the color patterns S and S" respectively. 

The following lemma shows that the computation over the branching program Bs,s' can be 
characterized by the matrix J*g gr, up to a negligible error term. 

Lemma 3.14. Let U he the uniform distribution over ({0,1}) . Then for any u,v £ V[G], 
S, S' G Ct, it holds that 

\Pr^r^u [Bs,s'{{s,s),x) = {u,v)] - (e(^^^)J5^5.,,e(„_^))| ^ 2Tn^/m. 

{ ■\ 2 2 

Proof. Let B^*' G M" ^" be the transition matrix of the branching program Bs,s' from the 
(i — l)th to ith level. Then we have 



T 



Pr^^u [Bs,s'i{s,s),x) = {u,v)] 



-(s.s) 



Rb 



(i) 



1 (M,t)) 



4 = 1 



So it suffices to show that 



T 



T 



nB^-n-" 



(0 

S,S' 



i=l 



i=l 



^ 2Tn^/m. 



By the construction oiBs^s'-, ^ random walk at {{ui,vi),i — l) goes to ((^2,^2);^) where U2 (resp. 
V2) are picked as Gui{x) mod deg(Mi) (resp. ^,;i(x) mod deg(ui)) for random x (or U2 = ui (resp. 
1^2 = ^1) if the corresponding step is lazy). Lemma F2.2l then implies that each entry B^ w ^ 



differs from ( J 



^ g g, ) by at most 2/m, and hence 

' ' {ui,Vi){u2,V2) 



b(*) - J 



S,S' 



^ 2n /m 



for all 1 ^ i ^ T. A simple induction on T then shows 

T T 



nBO-n-" 



(0 

S,S' 



j=l 



i=l 



^ 2Tn^/m. 



Lemma 13.121 and Lemma 13.141 show that to bound E X^X^ and E [ X^ ] , it suffices to 

study the stochastic matrices J^ g, . Instead of analyzing J^ ^5/ for different S, S' individually, 
we investigate them simultaneously for S, S' ranged over some set C^ C Ct defined as follows: 
For /c = 0, 1, . . . , r/3 (for simplicity assume T is a multiple of 3), define 

^Tk — {('5'i) • • • ) St) £ Ct '■ Si = lazy for z ^ A; or i > 2T/3 + k} 

and let C^ = IJi=o ^T i ■ Here C^ ^ consists of color patterns for colored random walks whose 
first k steps and last T/3 — k steps are lazy. 
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Let TV = lD/||lD||i, and it is easy to see that tt is the stationary distribution of Mg-. Define 
vectors x G M" and y € M" such that 

X(n,i,) = ^S,S'&C!j. [ {^{s,s)^'s,S'^^{u,v)) ] , Y = T^x). (3.3) 

Then the following key lemma states when the length of colored random walks is sufficiently 
large, x is close to the stationary distribution tt ® tt of M^ ^ Mg, and similarly y is close to 
the stationary distribution tv of M^. This implies that after sufficient long rounds, the prob- 
ability that colored random walks hit arbitrary nodes can be approximated by the stationary 
distribution of the matrix Mc, whose definition does not involve colored random walks. 

Lemma 3.15. ||x — tt i^) 7r||2 = o{n~'^''^) and ||y — 7r||2 = 0{n^^) for sufficiently large T = 
G(a~^logn). 

Lemma 13.151 is derived from a series of technical lemmas listed Section [3.2.31 below, whose 
proofs are deferred to the appendix. Now we are ready to prove Theorem 13.101 

Proof of Theorem \3.1(A By Lemma 13.91 and Lemma 13.121 we have 
Pr [ u receives the rumor in T rounds ] 



> 



Define x^ G M"' and y^ G M" such that 

xf„_„) = Bs,s'ec'^ [Pr^^p [Bs,s'{{s,s),x) = {u,v)]] , y^ = ri(x'), 

where V is the output of the PRG in Protocol [TJ We define x" G M" and y" G M" in the same 
way except that V is replaced by U. 

As the distribution V is the output of the PRG, and hence e- fools all branching programs 
Bs^S') we have llx^ — x^ll ^ e = n~^ and consequently lly^ ~ 5^11 ^ ris = n~^. 

By Lemma 13.141 we have ||x — x|| , ||y^ — y|| = 2Tn'^/m = 0{n~'^) as m = n '^^ is 
sufficiently large. 

By Lemma 13.151 ||y — t^\\^ ^ ||y — 7r||2 = 0{n~'^). Therefore ||y — 7r|| = 0(n^^). Note 
that TTu = deg(n)/X]„gy[g] deg('u) ^ (3n~^ . Therefore y^/7r„ ^ 1 - 0{n-^). 

By Lemma 13.151 ||x — vr (S" vrUg = o(n^^'^). Define vector w G M" such that "W(u,j;) = 1 if 
u = V and W(„,;) = otherwise. Then ||w||2 = n^'"^ . By the Cauchy-Schwarz inequality, we 
have 

E„ey[G] [|x(n,«) -{t^®t^)(u,u)\] ^ n"^||x-7r®7r||2||w||2 = oin'"^). 

By Markov's inequality, for all but o(l)-fraction of u G ^[G], we have 

|x(„,«) -{■K® '^){u,u)\ = oin-"^) 

and hence x^^^^ - (tt (g) 7r)(„ ,,) = o{rr'^). For such u we have x^^^^/(7r ® 7r)(„ „) ^ 1 + o(l) 

since (tt t^){u,u) ^ /3~^n~^. Therefore, for all but o(l)-fraction of nodes u G ^[G], the 
probability that u gets informed is at least 

(yn)' > (i-o(i))K)^ ^ 1 _ .^^ 

^M " (l + o(l))(7r0 7r)(,,,) 
By Markov's inequality, with probability 1 — o(l), all but o(l)-fraction of the nodes get informed. 
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3.2.3 Proof of Lemma [HH^ 

Let 1 = Ai ^ . . . ^ A„ ^ 1 be the n eigenvalues of the matrix Nq = D ' M^jD" ' , associated 
with n normalized orthogonal eigenvectors vi, . . . , v„ in M". By the hypothesis of the spectral 
expansion of G, we know max{|A2|, |A„|} ^ 1 — a. 

Note that ||Mg||i = 1 and ||Mg||oo = HD^^AcHoo ^ /3||A"1Ag||oo ^ /?• By Lemma ISTl we 
have ||Mg||2 ^ \/?- Similarly ||Mg (8) Mg||i = 1, ||Mg Mg||oo ^ /3^, and ||Mg (8) Mclb ^ P- 

Define the matrix Q G M" ^" by 

A I(8I + I(8Mg + Mg(8)I + P 
^~ 4 

Note that matrix Q corresponds to two steps in parallel, each independently chosen as a lazy 
or non-lazy step with equal probability. 

For a vector u G M" , we write u = u" + u as the (non-orthogonal) decomposition of u 
such that ull || tt (g) tt and u-*- _L 1. The decomposition exists and is unique. Indeed, we have 
the following simple lemma: 

Lemma 3.16. For any vector u G W^ , let u = u" +u-^ be the (non- orthogonal) decomposition 
of u such that u" \\ n it and u _L 1. Then the following statements hold: 

(a) Write u(D D)^^/^ = ui -F U2 such that ui || 1(D ® D)^/^ and U2 -L 1(D ® D)^/2_ y/^g^ 
ull = ui(D (g) D)V2 and u^ = U2(D (g) D)V2. 

(b) ||ull||2 ^ /3||u||2 and {{u^h ^ /3||u||2. 

2 

(c) /f u G M" represents a probability distribution (i.e., all entries of u are non-negative and 
||u||i = 1), then u" = tt (g tt. 

We first look at the properties of the stochastic matrix I '^^ ^ J (g I '^^ ^ J , which acts on 

2 

the vector space R" via right multiplication. The matrix and has the following properties: 

1. Matrix (^^^^] (g (^^^) fixes the vectors parallel to tt ® tt. 

2. Matrix ( ^^ '^ 1 (g ( ~^^ '^ 1 preserves the orthogonality to 1 (In fact, this is true for any 
stochastic matrix M: (uM, 1) = (u, IM"'') = (u, 1) = for u _L 1). Moreover, it shrinks 
vectors orthogonal to 1, because of the spectral expansion of G. 

We will show that Q exhibits approximately the same properties. 

Lemma 3.17. For any vector u G M" parallel to it® tv, zi /loZds i/iai ||(uQ)-'-||2 ^ /S^n~^/^||u||2. 

Define the diagonal matrix D' G M" such that 

„/ _ \5'Du,u u = v, 

^ (u.v)(u,v) \ /_- „ _^. , 

[(D®D)(„,^)(„,^) u^v. 
Note that 52 ^ D'(„,^)(^^^) ^ (D ® D)(„,„)(,,,) ^ A^ for ah u,v e V[G]. 

2 

Lemma 3.18. For any vector u G M" orthogonal to 1, it holds that uQ _L 1 and 



uQD 



/-1/2 



^ (1 - a/8) 
2 



uD'-i/2 



Given a distribution over V[G] x V[G], its two marginal distributions converge to the sta- 
tionary distribution of Mg rapidly: 
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Lemma 3.19. Let u represent a distribution over V[G\ x 1^[G], and suppose matrix M is a 
convex combination of products of\®\, \® Mg, Mq ® I and P. Then for any k ^'H it holds 
that 

ri(uQ'=M)-7r ^(l-Q/2)V^/^ 



and 



TzCuQ'^M) -TV ^ (1 - a/2)V^/^. 



The joint distribution "almost" converges to the stationary distribution of Mc Mg as 
weh: 

Lemma 3.20. Let u represent a distribution over V[G\ x ^[G]. Then for any k £ N it holds 
that 



uQ^ -7r(g)7r 



^ 8a-i/3^n-3/2 + (i _ a/S)'^/?^. 



Lemma 3.21. Lei u represent a distribution over V[G] x V^[G]. ylnd suppose matrix M satisfies 



M 



Mg + I 



or M 



Mg + I 



/or some k' £ N. T/ien /or k £ N, it holds that 



||uQ^M - TT (g)7r||2 

^ (1 - a/2)V^/^ + (1 - a/2)''' • 8a~'^f3^n''^/'^ + (1 - a/2)'='(l - a/8)V^- 



Proof of Lemma \3.15[ For ^ A;, A;' ^ L"/3, define 



U 



fc,fc' 



E56C^fc,5'eC,' 



'' J O C'/ 

T.fe' L •^'■^ ■ 



For the case k ^ k', one can check that 



U 



k,k' 



Mg + I 



k'-k 



Q2T/3+k-k' / J 



Mg + I 



k'-k 



One way to see this is to note that 'Uk,k' corresponds to two colored random walks with color 
patterns that are randomly chosen as follows: (1) in the first k and last T/3 — k' rounds, both 
two walks only take lazy steps; (2) from {k + l)th to k'th round, the first walk takes a lazy 
or non-lazy step with equal probability in each round whereas the second walk only takes lazy 
steps; (3) from (k' + l)th to (2T/3 + A;)th round, both two walks take a lazy or non-lazy step 
with equal probability in each round; (4) from {2T/3 + k + l)th to (2T/3 + A;')th round, the 
first walk only takes lazy steps whereas the second walk takes a lazy or non-lazy step with equal 
probability in each round. 



By Lemma 13.211 with u = e(^ .,) i — ^ 



( Mg+I 



k'-k 



we know that 



eis,s)^k,k' - vr 7r||2 ^ (1 - a/2)2^/3+'=-^'/33/2 + (i _ a/2f-'^ ■ Sa" V' 

+ (1 - a/2)'='-^(l - a/8)2^/3+fc-fc'^3 
^ „-2 + (1 _ «/2)fc'-fc . 8a-i/38n-3/2, 



n 



-3/2 



(3.4) 



for sufficiently large T = Q{a ^logn). For the case k ^ k' , a symmetric argument gives the 
bound n-2 + (1 - a/2)''-''' ■ Sa-i/S^n-^/s. 
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Note that x = E5 ^/gc 



^<s,s)'is,S' 



Eo^fc,fc'^T/3 [e(3,s)Ufc,fc'], by ([33]) and the fact that 



sets C'rp 1^ have the same size for different k. Hence 



|x — TT q5 7r||2 



E, 



[< 



0^fc,fc'^T/3 Lt^(s,s)^fc,fc' 



u. 



^E, 



0^A:,fc'^T/3 



^Ui 



TT (X) TT 



TT 0$) TT 



^E, 



0s;A:,fc'^T/3 



||e(s,s)L'fc,fc' •• ^ ■• 112 
"^-2^(l_«/2)l'='-'=l.8a-V'n-3/2 
^ „-2 + 8a- V'n-3/2 . Eo^,^^/e [ (1 - a/2)* ] 
^ „-2 + 96a-2/38n-3/2r-i = o{n-'/^), 

where the second inequahty follows from ()3.4p and the last step follows from T = @{a~^ logn). 
Next we prove the second claim. Assume ^ k ^ k' ^ T/3. By Lemma 13.191 with 



k'-k 

>I\ , we have 



||ri(e(,,,)U,,,,) - vrll^ ^ (1 - a/2)2^/3+'=-^'/3V2 ^ (1 _ a/2f/'(3'/\ 

For the case k' ^ k a symmetric argument gives the same bound. 
Also note that y = Ti(x) = Eo^fc,fc'^T/3 ['7i(e(s^^)Ufe_fc') ] • Hence 

||y - 7r||2 = ||Eo^fc,fc'^T/3 ['7i(e(s,s)Ufc,fc') - tt] H^ 



^ E 



0^fc,fc'^T/3 



|'7i(e(s,s)Ufc^fc') - 7r| 



^ (1 - a/2)^/3/3i/2 = o(„-2) 



for sufficiently large T = @{a ^ logn). 
3.3 Protocol for Dense Graphs 



The second statement of Theorem 13.31 is based on Theorem 13.101 and uses another idea that 
once fl{n) nodes are informed in an expander graph with 5 = n, we can just look at the rumors 
sent by this fixed set of nodes and ignore the others without impairing the rumor spreading rate 
too much. Our construction is based on expander walks and pairwise independent generators. 
The protocol is as follows. 

Protocol 3 (Protocol for Dense Graphs). Let m = n®^^' be a sufficiently large power of 2. Pick 
the following objects: 

1. An explicit pairwise independent generator Q = {Qq^. . . ^Qn-i) ■ {0,1}^ — )• [m]", where 
£ = 0(logn). 

2. An explicit degree-D expander graph G' with spectral gap 7 and node set {0, 1} where D 
and 7 are positive constants. 

The initial node having the rumor chooses a random string x as in Protocol \^ It also picks 
random strings y G {0, 1} and z = (zi, . . . , zt) G [Df , where T = ©(a^^ • logn). Once one 
node gets the rumor, it gets the ID u. In first T rounds, each node sends the rumor using the 
random string x as in Protocol{l\ except that (x, y, z) instead of x is appended with the rumor. In 
the (T + i)th round where 1 ^ i ^ T, node u chooses a neighbor v with index Qu{y) mod deg(M) 
in its adjacency list. It then replaces y with y' , the zith neighbor of y in G' . The triple of 
random strings {x,y',z) is then appended with the rumor and sent to the node v. 

Proposition 3.22. Protocoll^ uses 0(logn(loglogn + log(l/a))) random bits in total. 
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Now we prove the second statement of Theorem 



Proof of the second statement of Theorem \3.3[ By Theorem 13.101 with high probabihty, all but 
1 — 0(1) nodes are informed after 0{pr'^ ■ logn) rounds. Condition on this event which only 
depends on x. Then y and z are still uniformly distributed. We will show that all nodes are 
informed in the next T rounds with probability at least 1 — 1/n. Let yo = y and yi be the Zjth 
neighbor of yi-i in the expander graph G' picked in Protocol[3l i.e., {yo, . . . , yr) is an expander 
walk on G' with edge labels zi, . . . , zt- Note that in the (T + i)th. round, all nodes use yi as the 
seed of the pairwise independent generator Q. 

Fix an uninformed node v. Let / be the set of neighbors of v that are informed in the 
first T rounds. Then |/| ^ deg(t') — o{n) = 0(deg(u)). Consider a random w € {0,1}^. 
For u £ I, let X^ be the boolean random variable whose value is 1 iff n informs v using 
seed w, i.e., Gu{w) mod deg(it) equals the index of v in the adjacency list of u. We have 
\B[Xu] - l/deg{u)\ ^ 2/m for any u e I and |E[X„X„/] - E[X„]E[X„/]| ^ 2/m for 
any distinct n, u' E /, by the fact that ^ is a pairwise independent generator, together with 
Lemma |2. 2 1 By Cauchy-Schwarz inequality, we have 



where 



and 



E 



Pr 






Y^X^yO 



uGl 



E 



LE„s/^^>o 



> 



E 



(Sue/ ^« 



J^E [X„] ^ j; (-^ --)= n{\I\/deg{n)) = n{i). 
^-^ ^^^Vdegu) m 



E 




uei u,u'gi 



< 






1 



f^ Vdeg(n: 



+ 



m 



+ 



E 



1 



+ 



deg(u)deg(u') m 



= 0{\I\/deg{u) + |/|V(deg(n)degK))) = 0(1). 

So Pr [X^ue/^M > 0] ^ c for some constant c > 0. Let S be the set of seeds w G {0, 1}^ such 
that there exists some node u ^ I that informs v using seed w. Then IS*! ^ c- 2 . By the hitting 
property of expander walks [1[, we have 

For sufficiently large T = 0(a~^ • logn), this probability is at most 1/n^. So for any fixed 
V G V[G] \ I, the probability that v is not informed in T rounds is at most 1/n^. The claim 
follows by the union bound. ■ 



4 Protocol for General Graphs 

We further study the protocol for general graphs. Formally let G be any graph with n nodes, 
A{G)/5{G) = 0(1) and conductance (j), where 

e{S,V\S) 



<A(G) 



mm 



5cy,o<|S|<n min{vol(S'),vol(y \ S)}' 

Theorem 4.1. Let G be any graph with A/5 = 0{1), and conductance (j). Then there is 
a protocol using 0((l/(/>) • logn • (loglogn + log A)) random bits in total, so that with high 
probability every node in G gets informed in 0{{l/(f))\ogn) rounds. 
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4.1 Protocol 



Our protocol is based on pairwise independent generators and unbalanced expanders with near- 
optimal expansion. Here different rounds use different random bits. In contrast to 0(n log n) 
random bits per round used in the truly random protocol and O(logn) random bits per round 
used in 2j], we show that O (log log n + log A) random bits per round suffice to spread the 
rumor efficiently on general graphs G with IS.{G)/6{G) = 0{\). The formal description of our 
protocol is as follows: 

Protocol 4 (Protocol for General Graphs). Let e = A^®*--*^' he sufficiently small and m = 
2riog{4/e)l _ p-^}^ ^/jg following objects: 



An explicit {K,{1 - e"^ /A) D)- expander T : [n"] x [D] -^ Uie[D][^i]' where K = 2, D 
= ... = Md-1 = M s^ D. 



1 \0{i) 

^) andMn 



An explicit pairwise independent generator Q 
i = 0(log m + log M) = 0(log log n + log A) . 



{gu...,gM) ■■ {0,1}^ ^ H*^ where 



These two objects Q and T can be uniquely constructed from n'^ and A®'^), and hence are known 
to every informed node. 

The initial node having the rumor chooses a random string (si,...,st') where every Si is 
of the form {xi,yi) G [D] x {0,1}. This random string is appended with the rumor and sent 
to other nodes. Once one node gets the rumor, it gets the ID u. In the ith round, node u 
computes r = T{u,Xi) that is in [My], the uth copy of [M]. It then chooses the neighbor with 
index Qr{yi) mod deg(n) in its adjacency list to send the rumor. 

The protocol above presents a nice "two-level hashing" framework: The first level is based 
on a pairwise independent generator Q. While the PRG-based protocol in [2J] needs to generate 
0{n) blocks and different nodes need to use different blocks, our protocol only needs M = 
(Alogn) '^^ blocks and hence 0(loglogn + log A) random bits suffice for this purpose. The 
second level uses unbalanced expanders to map node ID li S [n'^] to r G [A'^(^)] by using 
log log n + log A random bits. After these, node u uses the value of the rth block of Q to choose 
the neighbors. It is easy to see that every informed node u only needs O(polylogn) arithmetic 
operations per round in order to determine its neighbor. 



0(log log n + log A) random bits 



0(log log n + log A) 
random bits 

node ID 



u £ \rf- 



T 



index r 




th block 



(Alogn)O(i) blocks 



Figure 1: Illustration of the protocol for general graphs. Every node u uses an unbalanced expander F to generate an 
index r, and uses the rth block of PRG Q to choose a neighbor to send the rumor. 

Proposition 4.2. Assume that Protocol\^ finishes in T rounds. Then it uses 0{T ■ (loglogn + 
log A)) random bits in total. 

Remark 4.3. Using the explicit constructions of unbalanced expanders in I 2 a] and pairwise 
independent generators in 16], our protocol is very simple and can be described as follows: Let 
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¥q be a finite field of size q = {A log n)~^ ^ for a sufficiently large q. Let E(X) be an irreducible 
polynomial of degree m over Fg such that n^ ^ q"^ = n'-'^^> . We embed [n'^] into the finite field 
jpm ^ Yq[X]/{E{X)). Through this we can identify every node with ID u & [n'^] with a polynomial 
Pu of degree at most m — 1 over ¥g. The protocol then uses the random string (si, . . . , st) where 
every Si is of the form {xi,yi) = {xi,ai,bi) £ F^. Then node u computes z = ai ■ Pu{xi) + bi 
(over¥q) in the ith round, and chooses the neighbor with index {z mod deg(n)) in its adjacency 
list to send the rumor. 

4.2 Analysis 

We start by analyzing a single round t and see the properties of our protocol. Let It be the set 
of informed nodes after round t, and Ut the set of uninformed nodes after round t. Remember 
that all the random choices in round t are determined by {xt,yt)- 
We need the following lemma: 

Lemma 4.4. Fix any round ^ t < T. For any u G Ut, v G It, let X^^^ be the boolean 
random variable whose value is 1 iff v informs u in round t + 1. Then it holds that 

1. \E[X^^u ] - l/deg(f )| ^ e for any u e Ut, v £ It, 

2. Cav[Xv^u,X^'^u'] ^ e for any u,u' G Ut, v,v' £ It, {u,v) ^ {u',v'). 

Proof. For any u G Ut, v G It, suppose the index of u in the adjacency list of v is z. By 
construction, X^^u equals 1 iff ^r(i),xt)(yt) mod deg(t;) = z. Fix xt- The fact that ^ is a pairwise 
independent generator together with Lemma [2 . 2 1 shows that |E [X^_!.t(] — l/deg(v)| ^ 2/m ^ e. 

For any u,u' E Ut, v,v' S It, first assume v 7^ v' . Suppose the index of u (resp. u') in the 
adjacency list of v (resp. v') is z (resp z'). By construction, Xy^u equals 1 iff Gr{v,xt){yt) mod 
deg(t') = z, and similarly for X^/^u'- By Lemma [2.131 and the fact that F is a {K, (1 — e^/4)D)- 
expander, the event \{T{v,xt),T{v' ,xt)}\ ^ (1 — e/2) • 2 > 1 occurs with probability at least 
1 — e/2 over the choices of xj. Condition on any xt such that this event occurs. We have 
T{v, Xt) 7^ ^{v', Xt). Using the fact that Q is pairwise independent together with Lemma [T^ we 
have Gov [X^^^jX^'^u' ] ^ 2/m. For the other choices of xt, we have Gov [X^_>.„, X^/^u' ] ^ 1 
since Xv^u,Xyi^y_i are boolean. Therefore Gov [Xy^u,X,^'^u' ] ^ (1 ~ ^/2)(2/m,) + (e/2) ^ e 
for random xt. 

Now assume v = v' and hence u ^ u' . We have 

Gov [ Xy^u , Xv~^U' ] = E [ Xy^u • Xy^yl ] " E [ Xy^u ] " E [ Xy^yl ] 

= 0-E[Xy^u]-E[Xy^y,]^0. U 

Next we prove the following lemma: 

Lemma 4.5. Fix a round ^ t < T and the set It of informed nodes before round t + 1. Fix 
also an arbitrary set of edges F C E(It, Ut). Let J be the set of nodes that become informed in 
round t +1 if we consider only transmissions of the rumor along the edges in F . 

1. Pr [ J 7^ 0] ^ ci min{|F|/A, 1} for some constant ci > 0. 

2. If\F\ = 0(A) then Pr [|J| ^ C2|F|/A] ^ C3 for some constant 02,03 > 0. 

Proof. Let X^^y be the boolean random variable whose value is 1 iff i; informs u in round t + 1. 
We first prove ([TJ. Let k = \F\ and suppose F = {(t>o,no), . . . , (ffc-i, ^fc-i)}- Let X = 
Y.^e[k\ ^v,^u, ■ Then by Cauchy-Schwarz inequality, E [ lx>o ] ^ (E [ X ]) VE [ X^ ] . By LemmaSai 

E [X ] = ^ E [X,^^„, ] ^ k{l/A -e) = f](|F|/A) 

i&[k\ 
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and 

i,je[k] 

= 2^ E [Xy^^Ui ] + 2^ {^[Xv.^Ui ] E [Xy^^Uj ] + Gov [Xv-_^u,,Xy^^Uj ] ) 

ie[k] ije[fc] 

^ k{l/6 + e) + (fe2 - k){{l/6 + ef + e) = 0{\F\/A + |F| VA^) 
where we use the conditions that A/6 = 0(1) and e = A^®''^' is sufficiently small. So 
Pr [ J / 0] = E [ lx>o ] ^ (E [ X ])VE [ X2 ] = f](min{|F|/A, 1}) 

and the first statement follows. 

Next we prove the second statement. For u £ Ut, let F„ be the set of edges in F incident 
to ti, Zu be the boolean random variable whose value is 1 iff u is informed in round t + 1 via 
edges in F„, and X„ = X;(„,n)eF„ ^«^«- So Zu = lx„>o and \J\ = E«e!7t ^«- ^°^ ^ ^ ^*' 
E[Z„] = E[lx„>o] ^ (E[X„])VE [X^] = n{\Fu\/A) by a similar argument as above. So 
E[|J|] = f)(^„g^jF„|/A) = n{\F\/A). Suppose E[|J|] ^ c|F| /A for constant c> 0. 

On the other hand, for any C2 ^ 0, we have 

E[|J|] = E [1|J|^C2|F|/A • I-''!] +E [1|J|<C2|F|/A • l«^l] 

^^[hj\>C2\F\/A-\J\] +E[1|j|<,,|^|/a] ■C2\F\/A 

andhenceE[l|j|^,,l^l/A-|J|] ^ E [ | J| ] - E [ 1|j|<,,|p|/a] • C2|F|/A ^ (c - C2)|F|/A. Pick 
C2 = c/2. By Cauchy-Schwarz inequalty, we have 

p,r|,|>,|;.|/Al-Fri (E[V|^,,|^|/A-|J|])\((c-C2)|F|/A)^ 
Pr[|J| ^C2\F\/A\ -E[1|j|^,2|^|/aJ ^ Sni^lTl ^ p^ri 7121 • (^-l) 



E[|J|2] - E[|JP 



Note that 



E[|J|2] = ^E[Z„]+ ^ B[Z^Zu'] 

u&Ut u,u'GUt 

^E[|J|]+ Yl E[X„X„,] 
u,u'eUt 

= E[|J|]+ Yl Yl {'^[X^^u]'E[X^,^u'] + Cov[Xy^u,Xy,^u']) 

u,u'£Ut iv,u)eFu 






u,u'GUt \ \(d,«)GF„ / \(v',u')eF^, 

Uy^u' 



E\\J\] + 



E \^u\\Fu'\/5^ 

u,u'€Ut 

\ Uy^u' / 

\ \u£Ut / 



E\\J\] + 

E[\J\] + 0{\F\'/6'). 
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Here E[|J|] = Enec/.E[Z„] ^ E„6C/.E[X„] = T.ueuM\P^\l^) = 0{\F\I5). Using the 
condition |F| = 0(A) = Vl{6), we have E [| Jp] = O {\F\^/5'^). Substitute it in (gj]) and use 
the condition /\/5 = 0(1), and then ([5|) follows. ■ 



From here our analysis follows 24 1. 



Proof of Proposition \4^ The proof is divided into four phases, depending on the number of 
informed nodes |/t|. 

Phase 1: 1 ^ |/i| ^ l/c/). This phase is divided into several subphases. For every 1 ^ i ^ 
log{l/(j)), subphase i begins when the number of informed nodes is at least 2*~^ and ends when 
this number is at least 2*. Assume that we are at the beginning of the ith subphase. Fix an 
arbitrary round t of the ith subphase and the set of informed nodes It; thus, 2*^^ ^ \It\ < 2*. 
We consider the number of nodes that become informed in round t + 1. Applying Lemma [4.5l fT]) 
with F = E{It,Ut) gives 

Pr[|/i+i\Ii| ^ 1] ^cimin{e(Ii,?7i)/A,l} ^cimin{0(VA)|Ii|,l}, 

Let p = cimin{(/)(VA)|/i|,l}. We have p = 0{(j)\lt\) since \It\ ^ l/(j) and A/5 = 6(1). 
Therefore, the expected time to increase |/i| from 2^^^ to 2* is at most 2*~^/p = 0(1/0). By 
Markov's inequality, 

Pr[|/i+,| ^2^ I \It\^2'~'] ^1/2 

for some r = 0{4>~^). Hence the time to complete Phase 1 can be upper bounded by r = 
O((l/0)) multiplied with the sum of log(l/i;A) = O(logn) independent geometric random vari- 
ables each with parameter 1/2. Applying a Chernoff bound for the sum of independent geometric 
random variables yields that the number of rounds required for Phase 1 is at most 0{{l/(j)) -log n) 
with high probability. 

Phase 2: l/(f) ^ \It\ ^ n/2. Fix a round t and the set of informed nodes It- We apply 
Lemma l4.5l| 2]l. with F = E{It, Ut). Note that the precondition \F\ = J7(A) is satisfied, as 

|F| = e(Ii, Ut)^(l)-6-\It\^cl)-6- {A/6) ■ (1/0) = n{A). 

Hence we conclude from Lemma l4.5l| 2|) that 

Pr[\It+i\It\^C2-^-6-\It\/A];?C3, 

for some constant 02,03 > 0. When this event occurs, we have |/t+i| ^ (1 + 02(l)S/A)\It\. 
So, the number of rounds until we have \It\ ^ n/2 can be upper bounded by the sum of 
logi_|_c2(^5/A('^/2) = O((l/0)logn) independent geometric random variables with parameters 
C3. Using again the Chernoff bound we obtain that Phase 2 is completed within at most 
O((l/0)logn) rounds with high probability. 

Phase 3: n/2 ^ |/(| ^ n — l/(p. The analysis is the same as in Phase 2 with the roles of It 
and Ut switched. 

Phase 4: n — 1/0 ^ \It\ ^ n. Again, the analysis is the same as in Phase 1 with the roles 
of /( and Ut switched. 

Since each of the four phases requires only O((l/0) • log n) rounds with high probability, the 
result follows by applying the union bound. ■ 
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5 Protocol For Strong Expander Graphs 

We further study the protocols for strong expander graphs. Let C be the norniaUzed Laplacian 
matrix of G defined by 

1 \i u = V and deg(ti) / 0, 

Cy^^ = I -j^^^^=^^^^ if u and v are adjacent, 

Vdeg(u) • deg(t;) 

otherwise. 

i.e. £ = I - D^i/^AgD"^/^ Let the eigenvalues of £ be = Aq ^ Ai ^ . . . ^ A„_i. Moreover 
define A = maxjl — Ai, A„„i — 1}. We call a family of graphs {Gjjj^o strong expander graphs if 
A/ 5 = 1 + o(l) and A = o(l) for any Gj. This graph family includes several interesting graphs, 
e.g. Ramanujan graphs, complete graphs, random graphs G{n,p) with p = uj{logn/n), and 
random d-regular graph where d is any increasing function of n. Our result for strong expander 
graphs is as follows: 

Theorem 5.1. Let G be a graph such that A/5 = 1 + o(l) and A = o(l). Then there is a 
protocol using 0(logn • (log log n + log A)) random bits in total, so that with high probability 
every node in G gets informed in log n + In n + o(log n) rounds. 

The runtime in Theorem 15.11 matches the precise runtime for the truly random protocol [1 



16|, ll8|, and is known to be tight [161]. Moreover, our protocol uses 0(log n • (log log n + log A)) 
random bits in total, in contrast to 0(n log n) random bits in the truly random protocol, and 



0(log n) random bits in the previous best result [2j|. For complete graphs, we use 0(log n) 
random bits in total, in contrast to 0(n log n) random bits used in the quasirandom rumor 
spreading protocol [l9| . 

5.1 Protocol 

Our protocol follows the framework of the pseudorandom protocol in ^]. However, instead of 
using Nisan's generators, we use PRGs for combinatorial rectangles and pairwise independent 
generators, together with unbalanced expanders. The construction of our protocol implies that, 
in order to maintain the precise rumor spreading time as in the truly random protocol, we only 



need 0(loglogn + log A) random bits per round, in contrast to 0(log n) random bits in [24 1. 
The formal description of our protocol is as follows: 

Protocol 5 (Protocol for Strong Expander Graphs). Let e = A"®*-^-* be sufficiently small, 
e' = 2^v^°siog"-^ Qjid jj^ = 0((logn)/e) a power of 2. Pick the following objects: 

• An explicit {<,K,{1 - e^ / 4) D)- expander T : [n^] x [D] -^ \_\ie\D]\.^i\' where K = A, 
D= (l2g^)°^^^ an(iMo = --- = Mi5_i = M^max{L>,AOW}. 



• An explicit function Q = {Qi, . . . ,Qm) '■ {0,1}^ — s- [m]^'^ that is both a pairwise in- 
dependent generator and a PRG e' -fooling Air^iM j^, where i = 0(logm + logM + 
log(l/e') log log(l/e') log log log(l/£')) = 0(log log n + log A) . 

These two objects Q and T can be uniquely constructed from n'^ and A '^), and hence are known 
to every informed node. 

The initial node having the rumor chooses a random string (si,...,st) where every Si is 
of the form {xi,yi) G [D] x {0,1}^. This random string is appended with the rumor and sent 
to other nodes. Once one node gets the rumor, it gets the ID u. In the ith round, node u 
computes r = T{u,Xi) that is in [My], the uth copy of [M]. It then chooses the neighbor with 
index GriVi) mod deg(n) in its adjacency list to send the rumor. 
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Proposition 5.2. Assume that Protocol\^ finishes in T rounds. Then it uses 0{T ■ (loglogn + 
log A)) random bits in total. 

5.2 Analysis 

To relate the spectral expansion of G with the expansion property, we use the following expander 
mixing lemma for general graphs. 

Lemma 5.3 (Expander Mixing Lemma for General Graphs |10l]). Let G be a general graph. 
Then for any subset X and Y it holds that 



e(xr)-™'<^)-"'<^'' 



vol(G) 



Jvol(X) • vol(y) • vol(X) • vol(y) 
^ A- -^ 



vol(G) 

In order to prove Theorem I5.H it suffices to show the following lemma: 

Lemma 5.4. Let G be a graph that satisfies the preconditions of Theorem \5.1[ Then with high 
probability all the following statements hold: 

• Phase I Suppose 1 ^ \It\ ^ n/logn. Then there is t = logn + o(logn) such that {It+rl > 
n/logn. 

• Phase II Suppose n/logn ^ \It\ ^ n — n/logn. Then there is t = o(logn) such 
that |/f+r| > n — n/logn. 

• Phase III Suppose \It\ ^ n — n/logn. Then there is t = lnn + o(logn) such that {It+rl = 
n. 

Proof. For any round t and u € Ut, v £ It, let Xy^^ be the boolean random variable whose 
value is 1 iff V informs u in round t + 1. Note that F is a {^K, (1 — e'^/4)Z))-expander and hence 
a (2, (1 — e^/4)D)-expander. And ^ is a pairwise independent generator. Then we observe that 
the statements in Lemma 14.41 hold here as well by the same proof. 
Phase L By Lemma 15.31 we have 

vol{It)-vol{Ut) . vol(/i) • vol{Ut) 
e{It, Ut) ^ 77777 A • 



^(1-A) 



vol(G) vol(G) 

vol(/i) • (vol(G) - vol(/t)) 



^(l-A)-5-|/i| 1 



vol(G) 

A • \Lt\ 
nd 

Since A = o(l) and \It\ ^ n/logn, we have 

e(/i,C/i)^(l-o(l))-A.|/i|f^ 



^ A d- log n 
HcncG 

|„,„\,|,£(^, (,__!_ _„„),, I. 

Define 7 ^ A + j^, and A ^ {u G N{It) \ h : \N{u) n It\ ^ 2d^}. Then e{A,It) ^ 
\A\ ■ 2d ■ ^/7. On the other hand by Lemma 15.31 it holds that 

/ , ^ N VOI(A) ■ Vol(It) , / 77-77 TT-TT 

e{A, It) ^ \^^^^^ ^ " + AVvo1(A)-vo1(/,) 

A2 . 1^1 . \It\ 



^ ^^^ + 7A • y/\A\ ■ \It 

nd 
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By the definition of set A we have e{A, It) ^ 2(i^/7 • |j4|, and hence 



\A\-2d-^^ 



A^ • 1^1 • \It 
nd 



+ 7A Vl^l-I^t 



^i^ + o{i))-^^ + i^-VW\hi 



which implies 1^41 ^ 7 • \It\. 

Now define B = N{It) \It\A. We have 

e{BJt) = e{N{It),It)-e{A,It);? (l 



logn 



o(l)-7 A-|/t|. 



With the above estimate at hand, we compute the expected value of \Itr]B\. Note that for any 
u £ B, the chance that it gets informed in round t + 1 is 



Pt+i{u) = Pr 



V {X,^u = 1) 
veN{u)r\it 



which is lower bounded by 



J] Pt[X,^u = 1]- Y1 



Pr 



veN{u)nit 
by Bonferroni inequalities. Hence 



vi,V2GN{u)nIt 



/\ (X,^^„ = 1) 



1=1,2 



Pt+iiu)^ \N{u)nlt\ ( ^-e 



\N{u)nit\ 
2 



^ + ^ 



^ (1 _ 0(1)) . I^(^)^-^*I (i (1 + «(!)) • l^(^) ^ ^*l 



1 

A2 



^(1-0(1)) 



A 

\Niu)nit\ 

A 



2A 



(5.2) 



where the first inequality follows from Lemma [4. 41 and the fact that e = (1/A) *^^' is sufficiently 
small, and the last step uses the condition that \N{u) n It| ^ 2(i^/7 = o(A). Hence we have 



E[|/i+i\/i|]^E[|Ii+ini?|] = j;pt+i(n)^^(l-o(l))- 



ueB 



ueB 



\Niu)nit 
A 



= (1-0(1)) -^^^(1-0(1)) -I/,!. 

Since \It+i \ It\ ^ l-^tl; it follows by using Markov's inequality (applied to \It\ — \It+i \ It\) 
that Pr [ |/i+i| ^ (2 — f{n))\It\ ] ^ 1 — g{n), where f{n) and g{n) are both functions that tend 
to zero. Hence the time to reach \It\ ^ n/ log n can be upper bounded by the sum of log2_/(ra) n 
independent, identically distributed geometric random variables with expectation at most 1 — 
0(1) each. Using the Chernoff bound from Lemma lA.ll vields for r = log2 n + o(logn) that 
Pr[|/t+^| >n/logn] = 1 - o(l). 

Phase II \It\ £ [n/logn,n — n/logn]. We further divide this phase into the two cases |/j| G 
[n/logn,n/2] and \It\ € [n/2,n — n/logn]. We start with the first case |/i| S [n/logn,n/2]. 
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For any u G N[It) \ It, the probability pt+i{u) that u gets informed in round t + 1 is lowered 
bounded by 

(1 _ 0(1)) . \^i^)^it\ A (1 + 0(1)) • \N{u) n it\ 



A V 2A 

by the same argument as in (j5.2p . This is then lower bounded by 

\N{u)r\it\ 



(1-0(1)) 



2A 



since we have \N{u) r\It\ ^ A. 
By Lemma 15.31 '^s have 

vol(/0 • vol(^0 vol(/0-vol([/,) 
^(^*' ^*) ^ vol(G) ^ vol(G) 

= (l-o(l)).^|/,|. 

Similar to the analysis of Phase I, we can lower bound the expected number of nodes that 
become informed in round t + 1: 

E[|Im\/t|]^ E Pm(n)^ (1-0(1)) E \lMnlA 

u£N{It)\It ueN(It)\It 

= (l-o(l))^^^4|I.|. 

^ ^ '^^ 2A 8A' ' 

Since |/i+i| ^ 2|/(|, we obtain as long as \It\ ^ n/2 that there are constants a,/3 > so 
that Pr [|/t+i| ^ (1 + a)|/t|] ^ /3. Hence the time to reach |/t| ^ n/2 can be upper bounded 
by the sum of log]^_,_„(logn) independent, identically distributed geometric random variables 
with expectation at most 1//3 each. Using a Chernoff bound for the sum of geometric random 
variables (see Lemma lA.ip yields that with probability 1 — o(l), we reach \It\ ^ n/2 within at 
most o(logn) additional rounds. 

Consider now the case |/t| G [n/2,n — n/logn]. To analyze this case, we examine the 
shrinking of Ut = V\It. Note that for any u £ Ut, the probability pt+i{u) that u gets informed 
in round t + 1 is lowered bounded by 

(1 _ 0(1)) . I^(^)^-^*I (i (1 + «(!)) • l^(^) ^ ^*l 



A V 2A 

by the same argument as in ()5.2p . This is then lower bounded by 

\N{u)nit\ 



(1-0(1))- 

since we have \N(u) D It\ ^ A. 

Again, as \Ut\ ^ n/2, by Lemma 15.31 we have 



2A 



eiIt,Ut)^{l-o{l)).^-\Ut\. 



Let us now compute the expected number of uninformed nodes after one additional round: 

E[|f/ml]=E(l-^m(-))^l^t|-(l-o(l))E(^^^^iK^) 



ueUt ueUt 

\ t\ y ^ ^^ 2A V 8A/ ' ' 
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A simple inductive argument yields for any integer r that, 



so for r = loglogn/log(l/(l — g^-)) + w(l), where io{l) is an arbitrarily slow growing function, 
we have E [ If/i+T-l ] = o(n/ log n). Hence by Markov's inequality, Pr [ |{7i+T-| ^ n/logn] = o(l). 

Phase III \It\ G [n — n/logn,n]. Again, we analyze the shrinking of the set Ut- By 
Lemma [2.13l for at least (1 — e/2)-fraction of the choices of xt, it holds that the size of {T(v, xt) '■ 
V G N{u) n It} is at least (1 — e/2)\N{u) D It\- From now on fix xt such that this event occurs. 

For any u G Ut, we have 



Pr[u^It+i]=Pr 



/\ {X,,^u = 0) 



v&N{u)r\it 



Let F be a subset of N{u) n It of size (1 — £/2)\N[u) n It\ such that the map T{-,xt) is injective 
when restricted to F. Note that the function y i— )• {QT(v,xt){y)) ipp ^ [my^^ is a PRG e'- 
fooling A^r„i|F| 1^1. By Lemma [2.51 the function y i— )• {QT{v,xt){y) mod deg(v)) is a PRG 
{e' + |F|A/m)-fooling Ms,\f\ where S = W^(^f[^^^{'")\- 
Then we have 



Pr[n^/t+i]<Pr 



l\ {X,^u = 0) 



.iieF 



^ J] Pr [X^^„ = 0] + e' + |F|A/m 



v&F 






<|l-^+. 



{l-e/2)\N{u)nIt\ 



+ e' + A^/m, 



where the second inequality follows from the properties of PRGs for combinatorial rectangles, 
and the third inequality follows from using pairwise independent generators. Since e ^ ■^, a 
simple induction shows that 



for any k ^ 0. So we have 



Pr[u^It+i]^ (1 



A 



<I'-A 



(l-e/2)|Ar(n)n/t| 



+ (1 - e/2) ■ \N{u) nlt\-e + e' + A^/m 



(l-e/2)\N{u)nIt\ 



+ (l-e/2) -A-e + e' + A^/m. 



The bound above applies for any choice of xt such that the size of {T{v, xt) : v G N{u) n /(} is 
at least (1 — e/2)|A^(u) n It\. And the probability of choosing such xt is at least 1 — e/2. So for 
random X(, we have 



Pr[n^/i+i]^ 1 



1 

A 



(l-e/2)-\N{u)r\It\ 



+ (1 - e/2) ■ A-e + e' + A^/m + e/2 



^ll-^J +o(l), 
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where we use the fact that e = (1/A)® is sufficiently small, and m = G((logn)/e). 

By dSH) it holds that e(/t, Ut) ^ (1 - ^ - o(l)) • A|[/i|. Let A C [/^ be the set of nodes v 
for which \N{v) n It\ ^ (1 — ^/7/2) • A, where 7 = j— ^ + o(l). We assume for a contradiction 
that \A\ > 2^- \Ut\. Hence, 

e(/i,[/i) = J^|A^(«)nli|+ J^ |A^(«)n/iK|A|-(l-^/2)A + |?7i\^|A 

v&A veUt\A 



mA - \A\^A/2 < (1 - ^ - o(l)) • A\Ut 



which yields the desired contradiction. Hence \A\ ^ 2,^\Ut\. Now define B = Ut\A so that for 
each n G i?, |A^(t;)n/i| > (1 — -^/2)A and |i?| ^ {l — 2^)\Ut\- Using linearity of expectations, 

B[\Ut+i\]^^Pr[u^It+i] + ^Pr[u^It+i] 
ueB ueA 

// -I N {l-£/2)|iV{n)n/t| \ 

/ I N (l-£/2)|Af(n)n/i| 

=^E 1-A +oi\Ut\) + \A\ 

«e_B ^ ^ 

Using the inequalities that (1 — 1/A;) ^ e~^' for A; ^ 1, e^ ^ l + 2x for sufficiently small constant 
X > 0, and the condition that \N{u) n /i| ^ (1 — ^/j/2) ■ A for u £ B, we get 






ue-B M6-B 

= (l + o(l))-e-i-|C/t|. 

By induction, it follows that for any step r > 0, E [ |C/t+r| ] ^ ((1 + o{l)) ■ e~^y ■ \Ut\. We 
choose r = — log(i_(_o(i)Ve-i (?^) = lnn + o(logn) and obtain that E[|f7t+T-|] ^ (1/logn). So 

Pr[|?7i+,|^l]^E[|[/i+,|]^l/logn. ■ 
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A Useful Lemmas 

Lemma A.l. Fix any < p < 1 and let Xi, . . . , Xn be independent geometric random variables 
on N with Pr[Xi = k] = (1 - p)''-^p for every fe G N. Let X = ^ILi ^i^ and fi = BlX]. 
Then it holds for all f3 > that 



Pr[X^ (l + /3)^]^e 



-n/32/(2(l+/3)) 



B Existential Proof 

In this section we show that O(logn) random bits are sufficient in rumor spreading for many 
classes of graphs (e.g. complete graphs, strong expanders, graphs with good conductance, etc.) 
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if we do not care about the computational complexity. We will prove the following general 
statement: 

Lemma B.l. Let C be a class of graphs on n nodes with no multi-edges. Let T' = rP^^' be an 
upper bound of spreading time. Suppose the spreading time for any graph in C is at most T with 
probability p for fully-random push protocol. Then there exists a (non- explicit) function 

f:{0,lYx[n]x[T']x[A]-^[A] 

such that 

1. f{x,u,t,d) G [d] for all {x,u,t,d) G {0,1}^ x [n] x [T'] x [A]. 

2. ^ = max{loglog|C|,logn + logA + loglogA} + 21og(l/e) + 0(1). 

3. for X umform,ly chosen from {0, 1}^, the spreading time for any graph G € C is at most T 
with probability p — e if node u uses f{x,u,t,deg{u)) G [deg(ti)] as the index of its receiver 
in its adjacency list in round t. 

In particular, £ is bounded fey 21ogn + loglogn + 21og(l/e) + 0(l) since \C\ ^ 2" and A ^ n. 

Proof. Choose f{x,u,t,d) £ [d] independently and uniformly at random for each {x,u,t,d) G 
{0, 1}^ X [n] X [T'] X [A]. Fix a graph G G C, an initial node in [n]. For each node u in the 
graph of degree deg(n), there are deg(ti)! possible orders of neighbors of u in its adjacency list. 
We also fix the order for each node u. Observe that for any fixed x, the random variables 
f{x,u,t,deg{u)) for all pairs (n, t) are independent and uniformly distributed. Let I{x) be the 
indicator random variable that equals 1 if the spreading time of G is at most T when node u 
uses f{x,u,t,deg{u)) to decide its receiver in round t. Then Prj [/(x) = 1] ^ p for any x and 
hence Ej [/(x)] ^ p. Also note that /(x)'s are independent. By the Chernoff bound it holds 
that 



Pr/ 



2-'Y,I{x)-2-'Y.Ef[I{x)] 



^ e 



^2exp(-2"e74) 



So with probability at least 1-2 exp(-2^e^/4), we have E^; [ I{x) ] ^ E^; [ E/ [ I{x) ]]-e '^ p-e. 
By the union bound, the probability that E^^ [I{x) ] ^ p — e holds for all graphs in C, arbitrary 
neighboring list of nodes, and all start nodes is at least 

l-n|C|-(A!f •2exp(-2V/4) 

which is greater than zero for sufficiently large i = maxjlog log \G\ , log n + log A + log log A} + 
21og(l/e) + 0(l). So there exists one function / such that E^; [/(x) ] '^ p — e holds for all graphs 
in C, i.e. the spreading time for any graph G G C is at most T with probability p — e over the 
choices of x, if node u uses f{x,u,t,deg{u)) G [deg(it)] to choose its receiver in round t. ■ 

The same result also hold for pull protocols and push-pull protocols, and can be shown using 
similar arguments. 

The following result follows from Lemma IB. II directly. 

Corollary B.2 (Existential Result). Let Q = {Gn}n^i be a family of graphs such that for 
any Gn G Q with n nodes the truly random protocol finishes in T = n^^^' rounds with high 
probability. Then there is a protocol which finishes in T rounds with high probability and uses 
31ogn random bits in total. 
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C Lower Bounds on Randomness Complexity 

We address the randomness requirement of rumor spreading protocols. We first introduce the 
pull model, which is a symmetric version of the push model, and the formal description is as 
follows: In round t ^ 0, every node u that does not yet have the rumor selects a neighbor v 
uniformly at random and asks for the rumor, and gets the rumor if v received the rumor before. 
In the push-pull model, in every round t, every node u chooses a random neighbor to perform 
push if node u has the rumor, or perform pull if u has not received the rumor. 

We prove the following lower bound on the number of random bits needed for any protocol 
in the push-pull model: 

Theorem C.l. Let G he any graph with n nodes and sufficiently large minimum degree 5 = 
il(logn). Then any protocol in the push-pull model that is oblivious of the order of adjacency 
lists of G and informs at least half of the nodes of G inT rounds with nonzero probability has to 
use more than log 6 — logT — 2 random bits. In particular, 0(logn) random bits are necessary 
when 5 = 0(n) and T = 0{n^^'^) for some constant e > 0. 

Here we even allow the protocol access to the ID of the initial node and the structure of 
G, i.e., the sets of neighbors of nodes as unordered sets. All we assume is that the protocol is 
oblivious of the the order of the adjacency lists. 

Proof. Suppose V[G] = [n]. Let A be the maximum degree of G and s be the initial node. We 
first claim that there exists a subset of nodes S of size n/2 (for simplicity assume n is even) 
such that deg(n)/4 ^ |5 n N{u)\ ^ 3deg(ti)/4 for all u G [n\: If we pick a random subset S 
of size n/2, then for any fixed u the condition deg(ii)/4 ^ |S'n N{u)\ ^ 3deg(ii)/4 holds, by 
the Chernoff bound, with probability at least 1 — e~ '-^ > 1 — 1/n for 6 = il(logn) sufficiently 
large. The claim then follows by taking the union bound. Pick such a subset S with the claimed 
property. Note that [n] \ S has the same property. We may therefore assume s G 5 by swapping 
S and [n] \ S ii necessary. 

A protocol for G using i random bits in T rounds is uniquely characterized by a pair of 
functions 

/i,/2:{0,l}^xHx[r]x[A]^[A] 

satisfying fi{x,u,t,d), f2{x,u,t,d) G [d] for all {x,u,t,d) G {0,1}^ x [n] x [T] x [A], in the 
sense that given the random string x, node u chooses a neighbor with index fi{x,u,t,deg{u)) 
(resp. /2(x,ti, t, deg(n))) in its adjacency list to push (resp. pull) the message in round t if it is 
informed (resp. uninformed). For each u £ [n], define /„ C [n] as 

^ ^ ('{/i(x,n,t,deg(t/)):xG{0,l}^tG [T]} uGS 
" \{/2(x,n,t,deg(t/)):xG{0,l}^tG[r]} u ^ S. 

Assume to the contrary that £ ^ log 6 — log T — 2. Then the size of /„ is at most 2 • T ^ 6/4 ^ 
min{|S'n N{u)\, \{[n] \S)ri N{u)\} for each u £ [n]. So it is possible to order the adjacency list 
of each u G [n] such that the neighbors picked by u using index set /„ are all in S* n N{u) if 
u £ S, or in {[n] \ S) (1 N{u) if w G [n] \ S. Then in the rumor spreading process, nodes in S 
push messages only to those also in S, and nodes in [n] \ S pull messages only from those also 
in [n] \ S. As s G 5, the nodes in [n] \ S never get informed. ■ 

For the push model and the pull model we may drop the assumption that 6 = il(logn) is 
sufficiently large, and also simplify the proof. 

Theorem C.2. Let G be any graph with n nodes. Then any protocol in the push model that is 
oblivious of the order of adjacency lists of G and informs all the nodes of G in T rounds with 
nonzero probability has to use more than log{6 — 1) — logT random bits. 
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Proof. The protocol is now characterized by a single function /i describing how rumors are 
pushed. Define lu = {fi{x,u,t,deg{u)) : x G {0, lY,t G [T]} for each u G [n]. Pick v G [n]\{s}. 
Assume to the contrary that £ ^ log((5 — 1) — logT. Then the size of lu is at most 2^ • T ^ 
(5 — 1 ^ |A^(n) \ {f}| for each u G [n]. So it is possible to order the adjacency list of each u G [n] 
such that the neighbors picked by u using index set I„ are all in N{u) \ {v}. Then the node v 
never gets informed. ■ 

Theorem C.3. Let G he any graph with n nodes. Then any protocol in the pull model that is 
oblivious of the order of adjacency lists of G and informs more than one node of G inT rounds 
with nonzero probability has to use more than log{6 — 1) — logT random bits. 

Proof. The protocol is now characterized by a single function /2 describing how rumors are 
pulled. Define lu = {f2{x,u,t,deg{u)) : x G {0,1}^, t G [T]} for each u G [n]. Assume to the 
contrary that i ^ log(5 — 1) — logT. Then the size of /„ is at most 2 -T ^ 5 —1 ^ l-^('w) \ {s}| 
for each u G [n]. So it is possible to order the adjacency list of each u G [n] such that the 
neighbors picked by u using index set lu are all in N{u) \ {s}. Then the nodes in [n] \ {s} never 
get informed. ■ 

D Omitted Proofs from Section [3] 

Lemma [377] (from page ED. ||M||^ ^ ||M||i||M||oo. 

Proof. Choose u G M™ \ {0} such that ||uM||2 = ||u||2||M||2. Note that 

||uM||2 = (uM,uM) = (u,uMMT) ^ Ai(u,u) = Ai||u||2 

where Ai is the largest (absolute) eigenvalue of the symmetric matrix MM^. Let x be the 
eigenvector of MM^ associated with Ai. We have 

Ai||x||i = ||Aix||i = ||xMMT||i ^ ||xM||i||MT||i ^ ||x||i||M||i||MT||i = ||x||i||M||i||M||oo. 

So ||M||^ = ||uM||^/||u||^ ^ Ai ^ ||M||i||M||oo. ■ 

M MZ II IIZ/II IIZ ^ -■- ^11 IIJ-II II ^-"^ 

Lemma 13. 16! (from pageilSp. For any vector u G M" , let\i = u" +u be the (non- orthogonal) 
decomposition of u such that u" \\ tv n and u^ _L 1. Then the following statements hold: 

(a) Write u(D (g) D)-i/2 = ui + U2 such that ui || 1(D D)i/2 and U2 -L 1(D (g) D)V2. Then 
ull = ui(D (g) D)^/2 Q^^ y± ^ ^^(D D)i/2_ 

(b) ||ull||2 ^ (3\\u\\2 and {{u^h ^ /3||u||2. 

(c) // u G M" represents a probability distribution (i.e., all entries of u are non-negative and 
||u||i = 1), then u" = tt (g tt. 

Proof. For Statement (a), it is easy to verify that ui(D(g)D)^/^ || 7r(g)7r and U2(D(g)D)^/^ _L 1. 
The orthogonal decomposition of u(D ig) D)~^'^ into ui and U2 is unique, which implies the 
uniqueness in (a). 

Statement (b) follows from (a) as 

llu" II2 ^ ||ui||2||(D (g D)i/2||2 ^ 11^(13 ^ D)-i/2||2 . A ^ /3||u||2 

and similarly 

llu^lb ^ ||U2||2||(D D)i/2||2 ^ ||u(D D)-i/2||2 . A ^ /3||u||2. 

Statement (c) follows from the facts that 

(u", 1) = (ui, 1(D ® D)i/2^ = (u(D ® D)-^/2^ 1(D CS> D)^/^^ = 1 = (tt «> tt, 1) 

and u'l II TT TT. ■ 
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Lemma 13.171 (from page I15|) . For any vector u € M" parallel to tt (gi tt, it holds that 

||(uQ)^||2^/34n-i/2||u||2. 

Proof. Without loss of generality assume u = 1(D D). Let 

E = P - Mg O Mg. 
Then Q = (^^^) ® f ^±^) + f . By the definition of matrix E, we have 



E 



{u,v)(w,x) 



U ^ V, 

(Mg)(„,^) - (Mg ® M.G){.a,v)(w,x) U = V,W = X, 
^-{Mg0Mg)(u,v){w,x) U = V,Wy^X. 



Then for any w,x G V'iG], we have 

|(uE)(^,^)| ^ A2|(1Mg)^|+A2|(1(Mg»Mg))(^,^)| ^ A2||Mg||oo + A2||Mg0Mg||oo ^ 2/3A2. 

So ||uE||oo ^ 2/3 A^. We also have 

Y^ |(uE)(^,^)| ^ Yl ^^(Mg ® Mg)(u,u)(w,x) ^ nA^llMG MgIIi = nA^. 

w^x u,w,x 

So ||uE||i = Y^^ ^ |(uE)(^ .j.)| ^ n • 2/3 A^ + nA^. By Holder's inequality, we have 

||uE||^ = (uE,uE) ^ ||uE||i||uE||oo ^ (4^^ + 2/3)nA^. 

As Q = (i±|^) ® (J±|^) + f and 1(D D) is fixed by {^^) ® (^±^) , we have 
(uQ)-L = (uE)-L/4. Therefore 



||(uQ)^||2 ^ (/3/4)||uE||2 ^ (/3/4) V(4/32 + 2/3)nA4 ^ /32ni/2A2. 
Finally note that ||u||2 = ||1(D D)||2 ^ nJ^ and hence ||(uQ)-L||2 ^ /3^n"^/2||u||2. 



Lemma 13.181 (from page [15]). For any vector u G M" orthogonal to 1, it holds that uQ _L 1 
and 



uQD 



/-1/2 



^ (1 - a/8) 



uD'-V2 



Proof. Since Q is stochastic, we have (uQ, 1) = (u, IQ^) = (u, 1) = 0. To prove the second 
claim, we look at each summand in 

4Q = I(g)I + I(g)MG + MG(8)I + P 
individually. Clearly ||u(I (^ I)'D'~'^^'^\\2 = HuD'^^/^n^, Write I (^ Mg = M + M' such that 



M 



(«,ii)(ui,x) 



{I(g,MG)(^u,v){w,x) W = X 

w j^ X 



m; 



w = X 



{u,v){w,x) 



{I(g)MG)(u,v){w,x) Wy^X. 



Note that each row (u, v) of M contains at most one nonzero entry, namely the one at the column 
(n,n) whose value is at most 5~^. So ||]V[||i ^ 5~^. We also have ||M||oo ^ ||I (S" Mg||oo ^ /3, 
and hence ||M||2 ^ a/||M||i||M||oo ^ /3V2(^-i/2. Write 

u(I (S) Mg)D'-i/2 = u(I (S) Mg)(D (S) D)~^/2 _^ J. 
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Here 



r = u(I0MG)(D'" 



-l/2 



(D0D)-^/2) = uM(D'- 



1/2 



(D^D)-i/2), 



since the columns {w, x) of I (8) Mg — M = M' are zero if w = x, whereas the rows {u, v) of 

D'-i/2 _ (D (g) D)~^/^ are zero ifu^v. We have 

llrlb ^ llulbllMlh D'-i/2 _ (D (g D)~^/2 

II 11^ ^ 1 1 1 1 ^ 1 1 11^ \ / 

^ ||U||2 • (/3l/2rl/2) • r^ ^ /33/2rl/2 ||uD'-l/2 

Symmetrically, we have 

u(Mg (^ I)D'-^/2 ^ ^(M^ (g I)(D (g D)"^/2 _^ r' 
with ||r'||2 ^ /33/2j-i/2||uD'-i/2||2. Note that 

u(I Mg)(D ^ D)-^/2 ^ ^(M^ ® I)(D D)-^/2 ^ y(D ^ YtyV^d ^ Ng + Ng I). 

Recall that Ng has n normalized orthogonal eigenvectors vi, . . . , v„ in M" associated with n 
real eigenvalues l = Ai>l — a^A2^...^A„^a— 1 respectively, and vi is parallel to 
ID ' . Then I (8) Ng + Ng I has n^ normalized orthogonal eigenvectors Vj (8 Vj associated 
with eigenvalues Xj + Xi, i,j = 1, . . . , n. The largest eigenvalue of I ® Ng + Ng C5 I is 2, 
corresponding to the eigenvector vi (gi vi || 1(D (gi D)^'^, and the second largest (absolute) 
eigenvalue is bounded by 2 — a. As u ± 1, we have (u(D (g D)"^/^, 1(D (g D)^/^) = (u, 1) = 0, 
i.e., u(D (g D)-i/2 _L i(p (^ D)^/2_ Therefore 



u(I Mg)(D (g> D)"i/2 ^ ^(M^ I)(D ^ D)-i 



/2 



^ (2 - a) 



u(D0D)"^/2 



and hence 



u(I (g Mg)D'-^/2 + u(Mg I)D'~^/^ 



^ (2 - a) 



^ (2 - q) 



uD'-i/2 



uD'-i/2 



+ r 2 + r 2 



^ (2 - Q + 2/3^/2^-1/2^ 



uD'-i/2 



Finally we look at matrix P. By permutating the rows (resp. columns) of P, we assume its 
first n rows (resp. n columns) are indexed by the diagonal elements {{u,u) : u £ V[G]}. By 
definition, we have 

'Mg 

Ml M2^ 

where (Mi M2) are the last n^ — n rows of Mg tX" Mg (we permutate the rows and columns 
of Mg <g Mg in the same way as we did for P). We claim that ||uPD'~ ' II2 ^ (1 + 
2/3^(5-^' 2) ||uD'- ' II2 for any vector u € M" (not necessarily orthogonal to 1): Write u = 
(ui U2) where Ui G M" and U2 G M" "", consisting of entries indexed by {u,v), u = v and 
u ^ V respectively. Also write 



D(gD 



Di 
D2 



D' 



D'l 
D2 



where Di,D'i G M"^" and D2 G ]R{«'-«)x(n2-n)_ Yiam (Di)„^ 
and D'l = (5 • D. Then 

||uD'-l/2||2 = ||uiD'-^/'||2 + ||u2D 



(D(gD)(„^„)(„^„) = (deg(u))^ 



-1/2||2 
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and 



uPD' 1/2 



uiMgD'i ^/^ + U2M1D1 ^/^ U2M2D2 ^/^ 



uiMgDi 



/-1/2 



+ 



U2M1D1 



-1/2 



+ 



U2M2D2 



/-1/2 



+ 2(uiMgD; '/',U2MiD; '/' 



(D.l) 



?-i 



uiMgD 



-1/2 



+ 



(0 U2)PD'-V2 



+ 2(5"i/2 (uiMgD^^/^ usMiD'i" 



-1/2 



We bound the first term by 



5-' 



uiMgD 



-1/2 



5-1 uiD^i/^Ng 



^(5 



-1 



U1D-V2 



uiD^ 



(D.2) 



and the inequahty in the middle holds since the largest (absolute) eigenvalue of Nq is 1. 
For the second term, write 

(0 U2) PD'-i/2 ^ (0 U2)P(D®D)-i/2^f 

where f = (O U2) P(D'"i/2 - (D (g) D)"^/^). As P and Mg Mq disagree only on rows (n, v) 
for u = V, we have 

(0 U2)P(D0D)-i/2^ (0 U2)(Mg0Mg)(D^D)-i/2^ (0 U2)(D0D)-i/2(N^^p^^)_ 
Therefore 



(0 U2)P(D^D)-i/2| 



< 



[0 U2)(D®D)-i/2(Ng®Ng) 
[0 U2)(D0D)-i/2 



U2D: 



-1/2 



Then we bound llf lU- Note that 



(0 U2)P(D'-i/2-(D®D)-i/2) ^= U2Mi(D'-^/'-D-^/2) 

We have ||Mi||oo ^ HMg^MgHoo ^ /5^- To bound ||Mi||i, observe that ||Mi||i is by definition 
the maximum of the ^i-norm of rows of Mi. Also note that G x G has no multi-edge and 
therefore a nonzero entry in the [u, t;)-th row of Mg tX" Mg = (D (g) D)~^Aqxg (and hence also 
Ml) has value l/(deg(u)deg(f)). So 

llMilli = max > l/(deg(tt)deg(f)) = max > l/(deg(tt)deg(f)). 

u^v {■w,w)£N{{u,v)) ^_^„ wGN{u)nN{v) 

For any u,v €z V[G], we have \N{u) D N{v)\ ^ min{deg(u), deg(t;)}, which implies ||Mi||i ^ 
min{l/deg(n),l/deg(t;)} ^ S'^. We then have ||Mi||^ ^ ||Mi||i||Mi||oo ^ P^6^^. Then 



|f||2 ^ IIU2II2IIM1II2 



j3/-l/2_j3-l/2 



which is bounded by 

IIU2II2IIM1II2 



D 



/-1/2 



^||u2||2-(/3rl/2). 5-1^/3^1/2 



U2D; 



'1/2 



(D.3) 
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< 



U2D2 



Therefore 

(0 U2)PD'-i/2 

Then we bound the last term 

2ri/2 ^u^MgD-V2,u2MiD'-^/') ^ 25-1/2 

^ 2/325-1/2 
^ /325-1/2 (" 



U2)P(D®D)-i/2 



+ l|f|| 



(D.4) 



uiMgD-i/2 
uiMgD-i/2 



U2M1D; 1/2 

U2II2IIM1II2 



D 



/-1/2 



uiD'i 



-1/2 



U2D2 



(D.5) 



uiD'i 



/-1/2 



+ 



U2D: 



-1/2 



where we use (|D.2p and ()D.3p in the third step. 

Combining (|Dl^ . (|E2]), (ID^ and (|D3]) . we obtain 



|UPD'-1/2||2 ^ fl + /325-l/2 



uiD^ 



+ ((l + ;325-l/2)2+/325-l 



/2 



U2D^ 



-1/2 



^ f 1 + 2/325-1/2 



1 + 2/325-1/2 



^/-l/2 



uD'-i/2 



+ 



U2D 



-1/2 



and hence 



uPD'-i/2 ^(1 + 2/325-1/2) uD'-i/2 



as claimed. 

Combining all the results above, we have 



uQD' 



-1/2 



< 



u(I (g) I)D'-i/2 + u(I (g) Mg)D'-i/2 + u(Mg «) I)D'-i/2 



+ 



uPD' 1/2 



(1 + (2 - a + 2/33/25-1/2) + (1 + 2/325-1/2)) 



^ 



uD'-i/2 



^ (1 - a/8) 



uD'-i/2 



where we use the assumption a ^ 8A 5 



2A-2.5 



Lemma 13.191 (from page I16p . Let u represent a distribution over V[G] x ^[G], and suppose 
matrix M is a convex combination of products 0/ 1 (g> I, I (g Mg, Mg I and P. Then for any 
k €N it holds that 

ri(uQ^M) -77 ^ (1 - 0/2)^/31/2, 



and 



r2(uQ^M) - TT ^ (1 - 0/2)^1/2. 
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Proof. We prove the claim for 7i(uQ M) and the case for 72 is symmetric. As one can check, 
TKuQ'^M) = u'((Mg + I)/2)''M' G M" where u' = 71 (u) G M" and M' is a convex combination 
of powers of Mc- Write u' = u'^^ + Ug such that u'^ || tt and Ug -L 1. Then u'^ = tt is fixed by 
((Mg + I)/2)^M'. So ri(uQ'=M) - TT = u'2((Mg + I)/2)'=M'. 

It remains to show ||u'2((Mg + I)/2)''M'||2 ^ {I- a/2)'' (3^/^. Let N = D^/^^'D-^/^ ^j^^pj^ 
is a convex combination of powers of Ng- As U2D~ ' _L ID ' , the matrix ((Ng + I)/2) Ng 
reduces the ^2-iiorm of U2D^ ' by at least a factor of (1 — a/2)~'' by the spectral expansion 
ofC Then 



u'2((Mg + I)/2)'^M' 



^ (1 - a/2)'' u'2D-i/2 Di/2 
^ (1 - a/2)^' u'D-i/2 Di/2 

2 

^(l-a/2)V'/'||u||2 
^ (1 - a/2)V^/^. 

The third step uses the fact that u'^D~ ' and UgD" ' are orthogonal. The last step uses the 
fact that ||u'||2 ^ ||u'||i = 1. ■ 



Lemma 13.201 (from page [16]). Let u represent a distribution over V[G] x V^[G]. Then for any 
k gN it holds that 



uQ^ 



TT (X> TT 



^ 8a- V^n~^/^ + (1 - a/8)'' 13'^ 



Proof. Note that (uQ^)l' = tt tt and hence 



UQ'^ - TT TT = (uQ'^) 



So it suffices to show 



Induct on k. The case A; = is trivial. For k > 0, assume the claim holds for k' < k. Let 
V = uQ - . Then v represents a distribution since Q " is stochastic. Write v = v" + v . 
Then v" = tt (g) tt, and 



.11 



E«ey[G]deg^(^^) 



^rn 



2^-1 



2 (E«eV[G] deg(n))2 
We have 

(uQ'=)^D'-V2 = (vQ)^D'-V2 = (vIIq)^D'-V2 + (v^Q)^D'-V2^ 

By Lemma 13.171 



llo^^ 



(viiQ) 



i^/3^n 



'4„-i/2 



^P^'n 



6„-3/2 



and hence 



By Lemma 13.181 

v^QD'-i/2 



(vIIQ)^D'-i/2 ^/S^r^n-^/^. 



^(l-a/8) v^D'-i/2 =(i_a/8) (uQ^-i)^D'-i/2 
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which is bounded by (1 — a/8) • 8a ^P^5 ^n ^' ^ + (1 — a/8)'' 136 ^ by the induction hypothesis. 
Then 



uQ^)^D'V2 ^ (vIIQ)^D'1/2 



+ 



v^QD'V2 

^ /36r in-3/2 + (1 - a/8) • Sa"^ p^S~^n-^/^ + (1 - a/8)''P6-^ 
^ 8a-^P^5-^n-^/^ + (1 - a/8)'=/3r^ 



as desired. 



Lemma 13.211 (from page [T6|) . Let u represent a distribution over V[G] x ^[G]. And suppose 
matrix M satisfies 



M=rM£±i«i 



or M = I 



Mg + I 
2 '^'J "' '" \'^ 2 

for some k' G N. Then for A; E N, it holds that 

IIuQ'^M - TT (g)7r||2 

^ (1 - a/2)V^/^ + (1 - a/2f ■ 8a~^P^n-^/^ + (1 - a/2)^'(l - a/8)V^- 



Proof. We prove the claim for the case M = ( — 0— 



and the other one is symmetric. 
Note that 

(uQ'')IIM = (tT (g) 7r)M = TT (g) TT 

and hence uQ'' -7r(g)7r = (uQ'')-LM. Write uQ''(D(gD)-i/2 ^ ui + U2 where ui || l(D(g)D)^/2 
and U2 ± 1(D D)V2. Then (uQ'')-^ = U2(D (g) 0)^/2 by definition. Therefore 

k' 



(\ k' 



U2(^^;^®l] (D®D)V2. 



Recall that N^ has n normalized orthogonal eigenvectors vi, . . . , v„ in M" associated with 

n real eigenvalues l = Ai>l — a^A2^...^A„^a — 1 respectively, and vi || ID ' . Then 

2 ® I has n^ normalized orthogonal eigenvectors Vj (g Vj associated with eigenvalues 2 ; 

i,j = l,...,n. Write U2 = wi + W2 such that wi £ V = span(vi g) vi, . . . , vi (g v„) and W2 is 

in the orthogonal complement of V. Then wi = (ID ' ) (g w' for some w' £ M". Then 



IW1II2 = (wi,Viri) = (U2,W1 



(uQ'=)^(D D)-i/2^ (lDi/2) w' 
(uQ*^)^,l®(w'D-i/2 

ri((uQ'=)^),w'D-i/2 



< 



/J3-1/2 



ri((uQ^)^) 

z 

< (1 - a/2)^-/3i/2 w'D-1/2 

^{1- a/2)'' 13^/^6-^ (lDi/2)»w' 

= (l-a/2)V'/'ri||wi||2 

where the second inequality follows from Lemma 13.191 So ||wi||2 ^ (1 — a/2) /3^'^5~^. The 
"^ (g 1 1 fixes w"! and reduces the ^2-iiorm of W2 to at most (1 — a/2) ||w2||2 ^ 
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(1 — a/2)^ IIU2II2, since W2 is in the span of eigenvectors Vj Vj, i 7^ 1, that have eigenvalues 
^ ^ 1 - a/2. Then 



U2 I z ® I 



(D®D)i/2 



||uQ''M-7r(g)7r||2 

^ (IIW1II2 + (1 - a/2)'^'||u2||2) ||(D D)i/2 

^ ||wi||2-A + (l-a/2)'='/3 U2(D(»D)i/2 

^ (1 - q/2)'=/33/2 + (1 - a/2)'^'/3 (uQ^)^ 

^ (1 - aj^ffl^ + (1 - a/2)^'/3 • (80"^^?^"^/^ + (1 - a/S)''/?^) 

= (1 - aj^ffl'^ + (1 - a/2)'^' • 8a-V^n-3/2 + (l _ ajlf {\ - aj^ff 

where we use the fact that ||(uQ'')-L||2 = ||uQ'' - -k ® 7r||2 ^ Sa'^/S'^n-^/^ + (1 - a/S)'^/?^ by 
Lemma [3:201 ■ 
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